org.apache.spark.SparkException: Job aborted due to stage failure: Task 10 in stage 2023.0 failed 4 times, most recent failure: Lost task 10.3 in stage 2023.0 (TID 17158, ip-172-31-12-157.us-west-2.compute.internal): java.lang.ClassCastException: optional binary element (UTF8) is not a group

Stack Overflow | user2849678 | 7 months ago
tip
Your exception is missing from the Samebug knowledge base.
Here are the best solutions we found on the Internet.
Click on the to mark the helpful solution and get rewards for you help.
  1. 0

    Strange error on EMR - Spark, while using Parquet

    Stack Overflow | 7 months ago | user2849678
    org.apache.spark.SparkException: Job aborted due to stage failure: Task 10 in stage 2023.0 failed 4 times, most recent failure: Lost task 10.3 in stage 2023.0 (TID 17158, ip-172-31-12-157.us-west-2.compute.internal): java.lang.ClassCastException: optional binary element (UTF8) is not a group

    Root Cause Analysis

    1. org.apache.spark.SparkException

      Job aborted due to stage failure: Task 10 in stage 2023.0 failed 4 times, most recent failure: Lost task 10.3 in stage 2023.0 (TID 17158, ip-172-31-12-157.us-west-2.compute.internal): java.lang.ClassCastException: optional binary element (UTF8) is not a group

      at org.apache.parquet.schema.Type.asGroupType()
    2. org.apache.parquet
      Type.asGroupType
      1. org.apache.parquet.schema.Type.asGroupType(Type.java:202)
      1 frame
    3. org.apache.spark
      ParquetReadSupport$.org$apache$spark$sql$execution$datasources$parquet$ParquetReadSupport$$clipParquetType
      1. org.apache.spark.sql.execution.datasources.parquet.ParquetReadSupport$.org$apache$spark$sql$execution$datasources$parquet$ParquetReadSupport$$clipParquetType(ParquetReadSupport.scala:131)
      1 frame