org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1.0 (TID 5, hdp-node4.affinytix.com): java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 13

Stack Overflow | sam | 2 months ago
tip
Click on the to mark the solution that helps you, Samebug will learn from it.
As a community member, you’ll be rewarded for you help.
  1. 0

    Parquet build with HDFS getmerge recovery

    Stack Overflow | 2 months ago | sam
    org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1.0 (TID 5, hdp-node4.affinytix.com): java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 13
  2. 0

    Apache Spark Unit Test not running in command line but runs in eclipse

    Stack Overflow | 2 days ago | Rahul
    org.apache.spark.SparkException: Job aborted due to stage failure: Task 2 in stage 13.0 failed 1 times, most recent failure: Lost task 2.0 in stage 13.0 (TID 55, localhost): java.lang.NoClassDefFoundError: parquet/common/schema/ColumnPath
  3. 0

    Cannot read PageHeader error (null)

    GitHub | 3 years ago | doug-explorys
    java.io.IOException: can not read class parquet.format.PageHeader: null
  4. Speed up your debug routine!

    Automated exception search integrated into your IDE

  5. 0

    Parquet-Avro table with case sensitive fields in hive

    Google Groups | 3 years ago | ckoz...@eng.ucsd.edu
    parquet.format.PageHeader: null
  6. 0

    Warnings trying to read Spark 1.6.X Parquet into Spark 2.X

    Stack Overflow | 1 week ago | javadba
    org.apache.parquet.VersionParser$VersionParseException: Could not parse created_by: parquet-mr version 1.6.0 using format: (.+) version ((.*) )?\(build ?(.*)\)

    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. parquet.org.apache.thrift.protocol.TProtocolException

      don't know what type: 13

      at parquet.org.apache.thrift.protocol.TCompactProtocol.getTType()
    2. Parquet Format
      TCompactProtocol.readFieldBegin
      1. parquet.org.apache.thrift.protocol.TCompactProtocol.getTType(TCompactProtocol.java:806)
      2. parquet.org.apache.thrift.protocol.TCompactProtocol.readFieldBegin(TCompactProtocol.java:500)
      2 frames
    3. org.apache.parquet
      ParquetFileReader.readNextRowGroup
      1. org.apache.parquet.format.InterningProtocol.readFieldBegin(InterningProtocol.java:158)
      2. org.apache.parquet.format.PageHeader.read(PageHeader.java:828)
      3. org.apache.parquet.format.Util.read(Util.java:213)
      4. org.apache.parquet.format.Util.readPageHeader(Util.java:65)
      5. org.apache.parquet.hadoop.ParquetFileReader$WorkaroundChunk.readPageHeader(ParquetFileReader.java:668)
      6. org.apache.parquet.hadoop.ParquetFileReader$Chunk.readAllPages(ParquetFileReader.java:546)
      7. org.apache.parquet.hadoop.ParquetFileReader.readNextRowGroup(ParquetFileReader.java:496)
      7 frames
    4. org.apache.spark
      UnsafeRowParquetRecordReader.nextKeyValue
      1. org.apache.spark.sql.execution.datasources.parquet.UnsafeRowParquetRecordReader.checkEndOfRowGroup(UnsafeRowParquetRecordReader.java:604)
      2. org.apache.spark.sql.execution.datasources.parquet.UnsafeRowParquetRecordReader.loadBatch(UnsafeRowParquetRecordReader.java:218)
      3. org.apache.spark.sql.execution.datasources.parquet.UnsafeRowParquetRecordReader.nextKeyValue(UnsafeRowParquetRecordReader.java:196)
      3 frames
    5. Spark
      SqlNewHadoopRDD$$anon$1.hasNext
      1. org.apache.spark.rdd.SqlNewHadoopRDD$$anon$1.hasNext(SqlNewHadoopRDD.scala:194)
      1 frame
    6. Scala
      AbstractIterator.toArray
      1. scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:327)
      2. scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:327)
      3. scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:308)
      4. scala.collection.Iterator$class.foreach(Iterator.scala:727)
      5. scala.collection.AbstractIterator.foreach(Iterator.scala:1157)
      6. scala.collection.generic.Growable$class.$plus$plus$eq(Growable.scala:48)
      7. scala.collection.mutable.ArrayBuffer.$plus$plus$eq(ArrayBuffer.scala:103)
      8. scala.collection.mutable.ArrayBuffer.$plus$plus$eq(ArrayBuffer.scala:47)
      9. scala.collection.TraversableOnce$class.to(TraversableOnce.scala:273)
      10. scala.collection.AbstractIterator.to(Iterator.scala:1157)
      11. scala.collection.TraversableOnce$class.toBuffer(TraversableOnce.scala:265)
      12. scala.collection.AbstractIterator.toBuffer(Iterator.scala:1157)
      13. scala.collection.TraversableOnce$class.toArray(TraversableOnce.scala:252)
      14. scala.collection.AbstractIterator.toArray(Iterator.scala:1157)
      14 frames
    7. Spark Project SQL
      SparkPlan$$anonfun$5.apply
      1. org.apache.spark.sql.execution.SparkPlan$$anonfun$5.apply(SparkPlan.scala:212)
      2. org.apache.spark.sql.execution.SparkPlan$$anonfun$5.apply(SparkPlan.scala:212)
      2 frames
    8. Spark
      Executor$TaskRunner.run
      1. org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:1881)
      2. org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:1881)
      3. org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66)
      4. org.apache.spark.scheduler.Task.run(Task.scala:89)
      5. org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214)
      5 frames
    9. Java RT
      Thread.run
      1. java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
      2. java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
      3. java.lang.Thread.run(Thread.java:745)
      3 frames