java.nio.charset.UnsupportedCharsetException: Cp943C

JIRA | Gordon Mohr | 10 years ago
  1. 0

    When an extractor/replaycharsequence cannot interpret a charset, the problem should be locally noted and logged, so that other processors will still be run normally, rather than generating a runtime exception relayed all the way to the catchall handler. Possibilities to note the problem: use a shorthand CrawlURI 'annotation'; use a 'local-errors' entry as with fetch-module connect/protocol problems. An example of the exception from a test crawl: 20040529201501109 -5 13252 #43 http://www.sakigake.co.jp/servlet/SKNEWS.News.kiji?InputKIJICODE=20040505d 1875 text/html 3t LLLL http://dict.cre.jp/ java.nio.charset.UnsupportedCharsetException: Cp943C at java.nio.charset.Charset.forName(Charset.java:430) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decode InMemory(ReplayCharSequenceFactory.java:809) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decode (ReplayCharSequenceFactory.java:745) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.(Repla yCharSequenceFactory.java:708) at org.archive.io.ReplayCharSequenceFactory.getReplayCharSequence(ReplayCharSe quenceFactory.java:134) at org.archive.io.RecordingOutputStream.getReplayCharSequence(RecordingOutputS tream.java:388) at org.archive.io.RecordingInputStream.getReplayCharSequence(RecordingInputStr eam.java:278) at org.archive.util.HttpRecorder.getReplayCharSequence(HttpRecorder.java:301) at org.archive.crawler.extractor.ExtractorHTML.innerProcess(ExtractorHTML.java :337) at org.archive.crawler.framework.Processor.process(Processor.java:106) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:208) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:138)

    JIRA | 10 years ago | Gordon Mohr
    java.nio.charset.UnsupportedCharsetException: Cp943C
  2. 0

    When an extractor/replaycharsequence cannot interpret a charset, the problem should be locally noted and logged, so that other processors will still be run normally, rather than generating a runtime exception relayed all the way to the catchall handler. Possibilities to note the problem: use a shorthand CrawlURI 'annotation'; use a 'local-errors' entry as with fetch-module connect/protocol problems. An example of the exception from a test crawl: 20040529201501109 -5 13252 #43 http://www.sakigake.co.jp/servlet/SKNEWS.News.kiji?InputKIJICODE=20040505d 1875 text/html 3t LLLL http://dict.cre.jp/ java.nio.charset.UnsupportedCharsetException: Cp943C at java.nio.charset.Charset.forName(Charset.java:430) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decode InMemory(ReplayCharSequenceFactory.java:809) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decode (ReplayCharSequenceFactory.java:745) at org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.(Repla yCharSequenceFactory.java:708) at org.archive.io.ReplayCharSequenceFactory.getReplayCharSequence(ReplayCharSe quenceFactory.java:134) at org.archive.io.RecordingOutputStream.getReplayCharSequence(RecordingOutputS tream.java:388) at org.archive.io.RecordingInputStream.getReplayCharSequence(RecordingInputStr eam.java:278) at org.archive.util.HttpRecorder.getReplayCharSequence(HttpRecorder.java:301) at org.archive.crawler.extractor.ExtractorHTML.innerProcess(ExtractorHTML.java :337) at org.archive.crawler.framework.Processor.process(Processor.java:106) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:208) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:138)

    JIRA | 10 years ago | Gordon Mohr
    java.nio.charset.UnsupportedCharsetException: Cp943C
  3. Speed up your debug routine!

    Automated exception search integrated into your IDE

  4. 0

    "charset=latin-1" is not properly detected

    GitHub | 6 years ago | Rico99
    java.nio.charset.UnsupportedCharsetException: LATIN-1
  5. 0

    DataUtil doesn't check if <meta charset> is a supported charset before continuing

    GitHub | 2 years ago | BalusC
    java.nio.charset.UnsupportedCharsetException: iso-8

    1 unregistered visitors
    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. java.nio.charset.UnsupportedCharsetException

      Cp943C

      at java.nio.charset.Charset.forName()
    2. Java RT
      Charset.forName
      1. java.nio.charset.Charset.forName(Charset.java:430)
      1 frame
    3. webarchive-commons
      HttpRecorder.getReplayCharSequence
      1. org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decodeInMemory(ReplayCharSequenceFactory.java:809)
      2. org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.decode(ReplayCharSequenceFactory.java:745)
      3. org.archive.io.ReplayCharSequenceFactory$MultiByteReplayCharSequence.<init>(ReplayCharSequenceFactory.java:708)
      4. org.archive.io.ReplayCharSequenceFactory.getReplayCharSequence(ReplayCharSequenceFactory.java:134)
      5. org.archive.io.RecordingOutputStream.getReplayCharSequence(RecordingOutputStream.java:388)
      6. org.archive.io.RecordingInputStream.getReplayCharSequence(RecordingInputStream.java:278)
      7. org.archive.util.HttpRecorder.getReplayCharSequence(HttpRecorder.java:301)
      7 frames
    4. org.archive.crawler
      ToeThread.run
      1. org.archive.crawler.extractor.ExtractorHTML.innerProcess(ExtractorHTML.java:337)
      2. org.archive.crawler.framework.Processor.process(Processor.java:106)
      3. org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:208)
      4. org.archive.crawler.framework.ToeThread.run(ToeThread.java:138)
      4 frames