org.apache.parquet.VersionParser$VersionParseException: Could not parse created_by: parquet-mr version 1.6.0 using format: (.+) version ((.*) )?\(build ?(.*)\)

Google Groups | Benjamin Angelaud | 4 months ago
  1. 0

    druid-parquet extension, SegmentDescriptorInfo is not found

    Google Groups | 4 months ago | Benjamin Angelaud
    org.apache.parquet.VersionParser$VersionParseException: Could not parse created_by: parquet-mr version 1.6.0 using format: (.+) version ((.*) )?\(build ?(.*)\)

    Root Cause Analysis

    1. org.apache.parquet.VersionParser$VersionParseException

      Could not parse created_by: parquet-mr version 1.6.0 using format: (.+) version ((.*) )?\(build ?(.*)\)

      at org.apache.parquet.VersionParser.parse()
    2. org.apache.parquet
      ParquetRecordReader.initialize
      1. org.apache.parquet.VersionParser.parse(VersionParser.java:112)
      2. org.apache.parquet.CorruptStatistics.shouldIgnoreStatistics(CorruptStatistics.java:60)
      3. org.apache.parquet.format.converter.ParquetMetadataConverter.fromParquetStatistics(ParquetMetadataConverter.java:263)
      4. org.apache.parquet.format.converter.ParquetMetadataConverter.fromParquetMetadata(ParquetMetadataConverter.java:567)
      5. org.apache.parquet.format.converter.ParquetMetadataConverter.readParquetMetadata(ParquetMetadataConverter.java:544)
      6. org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:431)
      7. org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:386)
      8. org.apache.parquet.hadoop.ParquetRecordReader.initializeInternalReader(ParquetRecordReader.java:162)
      9. org.apache.parquet.hadoop.ParquetRecordReader.initialize(ParquetRecordReader.java:145)
      9 frames
    3. Hadoop
      YarnChild$2.run
      1. org.apache.hadoop.mapreduce.lib.input.DelegatingRecordReader.initialize(DelegatingRecordReader.java:84)
      2. org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:557)
      3. org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:795)
      4. org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
      5. org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)
      5 frames
    4. Java RT
      Subject.doAs
      1. java.security.AccessController.doPrivileged(Native Method)
      2. javax.security.auth.Subject.doAs(Subject.java:422)
      2 frames
    5. Hadoop
      UserGroupInformation.doAs
      1. org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
      1 frame
    6. Hadoop
      YarnChild.main
      1. org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
      1 frame