java.io.EOFException

JIRA | Aaron Binns | 6 years ago
  1. 0

    Indexing the LoC E04 collection, I ran into a 0-byte arc.gz file: http://locdata950.us.archive.org:19972/0/E04-CRAWL-50-20040918065656-00870-crawling001.archive.org.arc.gz The Java gzip library doesn't like it (see exception trace at bottom) and the ARCReaderFactory bails out. Hard to say if this is: * bug in ARCReader(Factory) code * bug in our repository to allow a 0-byte arc.gz file to be ingested * bug in my code for not handling this exception java.io.EOFException at java.util.zip.GZIPInputStream.readUByte(GZIPInputStream.java:207) at java.util.zip.GZIPInputStream.readUShort(GZIPInputStream.java:197) at java.util.zip.GZIPInputStream.readHeader(GZIPInputStream.java:136) at java.util.zip.GZIPInputStream.(GZIPInputStream.java:58) at org.archive.io.GzippedInputStream.(GzippedInputStream.java:103) at org.archive.io.GzippedInputStream.(GzippedInputStream.java:90) at org.archive.io.arc.ARCReaderFactory$CompressedARCReader.(ARCReaderFactory.java:367) at org.archive.io.arc.ARCReaderFactory.getArchiveReader(ARCReaderFactory.java:140) at org.archive.io.arc.ARCReaderFactory.get(ARCReaderFactory.java:131) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:181) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:217) at org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:200) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:99) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:93) at org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:88) at org.archive.nutchwax.Importer.map(Importer.java:196) at org.archive.nutchwax.Importer.map(Importer.java:98) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) at org.apache.hadoop.mapred.Child.main(Child.java:170)

    JIRA | 6 years ago | Aaron Binns
    java.io.EOFException
  2. 0

    Indexing the LoC E04 collection, I ran into a 0-byte arc.gz file: http://locdata950.us.archive.org:19972/0/E04-CRAWL-50-20040918065656-00870-crawling001.archive.org.arc.gz The Java gzip library doesn't like it (see exception trace at bottom) and the ARCReaderFactory bails out. Hard to say if this is: * bug in ARCReader(Factory) code * bug in our repository to allow a 0-byte arc.gz file to be ingested * bug in my code for not handling this exception java.io.EOFException at java.util.zip.GZIPInputStream.readUByte(GZIPInputStream.java:207) at java.util.zip.GZIPInputStream.readUShort(GZIPInputStream.java:197) at java.util.zip.GZIPInputStream.readHeader(GZIPInputStream.java:136) at java.util.zip.GZIPInputStream.(GZIPInputStream.java:58) at org.archive.io.GzippedInputStream.(GzippedInputStream.java:103) at org.archive.io.GzippedInputStream.(GzippedInputStream.java:90) at org.archive.io.arc.ARCReaderFactory$CompressedARCReader.(ARCReaderFactory.java:367) at org.archive.io.arc.ARCReaderFactory.getArchiveReader(ARCReaderFactory.java:140) at org.archive.io.arc.ARCReaderFactory.get(ARCReaderFactory.java:131) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:181) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:217) at org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:200) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:99) at org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:93) at org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:88) at org.archive.nutchwax.Importer.map(Importer.java:196) at org.archive.nutchwax.Importer.map(Importer.java:98) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) at org.apache.hadoop.mapred.Child.main(Child.java:170)

    JIRA | 6 years ago | Aaron Binns
    java.io.EOFException
  3. Speed up your debug routine!

    Automated exception search integrated into your IDE

    4 unregistered visitors
    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. java.io.EOFException

      No message provided

      at java.util.zip.GZIPInputStream.readUByte()
    2. Java RT
      GZIPInputStream.<init>
      1. java.util.zip.GZIPInputStream.readUByte(GZIPInputStream.java:207)
      2. java.util.zip.GZIPInputStream.readUShort(GZIPInputStream.java:197)
      3. java.util.zip.GZIPInputStream.readHeader(GZIPInputStream.java:136)
      4. java.util.zip.GZIPInputStream.<init>(GZIPInputStream.java:58)
      4 frames
    3. webarchive-commons
      ArchiveReaderFactory.get
      1. org.archive.io.GzippedInputStream.<init>(GzippedInputStream.java:103)
      2. org.archive.io.GzippedInputStream.<init>(GzippedInputStream.java:90)
      3. org.archive.io.arc.ARCReaderFactory$CompressedARCReader.<init>(ARCReaderFactory.java:367)
      4. org.archive.io.arc.ARCReaderFactory.getArchiveReader(ARCReaderFactory.java:140)
      5. org.archive.io.arc.ARCReaderFactory.get(ARCReaderFactory.java:131)
      6. org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:181)
      7. org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:217)
      8. org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:200)
      9. org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:99)
      10. org.archive.io.ArchiveReaderFactory.getArchiveReader(ArchiveReaderFactory.java:93)
      11. org.archive.io.ArchiveReaderFactory.get(ArchiveReaderFactory.java:88)
      11 frames
    4. org.archive.nutchwax
      Importer.map
      1. org.archive.nutchwax.Importer.map(Importer.java:196)
      2. org.archive.nutchwax.Importer.map(Importer.java:98)
      2 frames
    5. Hadoop
      Child.main
      1. org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
      2. org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
      3. org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
      4. org.apache.hadoop.mapred.Child.main(Child.java:170)
      4 frames