java.io.IOException: Could not read footer: java.lang.NoClassDefFoundError: parquet/org/codehaus/jackson/JsonGenerationException at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallel(ParquetFileReader.java:189) at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallelUsingSummaryFiles(ParquetFileReader.java:145) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:354) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:339) at >>>> parquet.hadoop.ParquetInputFormat.getSplits(ParquetInputFormat.java:246) at >>>> org.apache.spark.rdd.NewHadoopRDD.getPartitions(NewHadoopRDD.scala:85) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:207) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:205)

Google Groups | Uri Laserson | 3 years ago
tip
Click on the to mark the solution that helps you, Samebug will learn from it.
As a community member, you’ll be rewarded for you help.
  1. 0

    Re: Using Parquet from an interactive Spark shell

    Google Groups | 3 years ago | Uri Laserson
    java.io.IOException: Could not read footer: java.lang.NoClassDefFoundError: parquet/org/codehaus/jackson/JsonGenerationException at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallel(ParquetFileReader.java:189) at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallelUsingSummaryFiles(ParquetFileReader.java:145) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:354) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:339) at >>>> parquet.hadoop.ParquetInputFormat.getSplits(ParquetInputFormat.java:246) at >>>> org.apache.spark.rdd.NewHadoopRDD.getPartitions(NewHadoopRDD.scala:85) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:207) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:205)

    Root Cause Analysis

    1. java.io.IOException

      Could not read footer: java.lang.NoClassDefFoundError: parquet/org/codehaus/jackson/JsonGenerationException at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallel(ParquetFileReader.java:189) at >>>> parquet.hadoop.ParquetFileReader.readAllFootersInParallelUsingSummaryFiles(ParquetFileReader.java:145) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:354) at >>>> parquet.hadoop.ParquetInputFormat.getFooters(ParquetInputFormat.java:339) at >>>> parquet.hadoop.ParquetInputFormat.getSplits(ParquetInputFormat.java:246) at >>>> org.apache.spark.rdd.NewHadoopRDD.getPartitions(NewHadoopRDD.scala:85) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:207) at >>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:205)

      at scala.Option.getOrElse()
    2. Scala
      Option.getOrElse
      1. scala.Option.getOrElse(Option.scala:120)
      1 frame
    3. Spark
      RDD.collect
      1. org.apache.spark.rdd.RDD.partitions(RDD.scala:205)
      2. org.apache.spark.SparkContext.runJob(SparkContext.scala:863)
      3. org.apache.spark.rdd.RDD.collect(RDD.scala:602)
      3 frames
    4. Unknown
      $iwC.<init>
      1. $iwC$$iwC$$iwC$$iwC.<init>(<console>:20)
      2. $iwC$$iwC$$iwC.<init>(<console>:25)
      3. $iwC$$iwC.<init>(<console>:27)
      4. $iwC.<init>(<console>:29)
      4 frames