gnu.inet.encoding.IDNAException: String too long. .online.fr

JIRA | Gordon Mohr | 1 decade ago
tip
Click on the to mark the solution that helps you, Samebug will learn from it.
As a community member, you’ll be rewarded for you help.
  1. 0

    Seen in stdout of test crawl, early in a crawl that was recovering from a recover log: org.apache.commons.httpclient.URIException: gnu.inet.encoding.IDNAException: String too long. .online.fr at org.archive.net.UURIFactory.fixupDomainlabel(UURIFactory.java:622) at org.archive.net.UURIFactory.fixupAuthority(UURIFactory.java:572) at org.archive.net.UURIFactory.fixup(UURIFactory.java:475) at org.archive.net.UURIFactory.create(UURIFactory.java:319) at org.archive.net.UURIFactory.create(UURIFactory.java:309) at org.archive.net.UURIFactory.getInstance(UURIFactory.java:262) at org.archive.util.SurtPrefixSet.prefixFromPlain(SurtPrefixSet.java:213) at org.archive.util.SurtPrefixSet.addFromPlain(SurtPrefixSet.java:193) at org.archive.util.SurtPrefixSet.importFromMixed(SurtPrefixSet.java:180) at org.archive.crawler.deciderules.SurtPrefixedDecideRule.buildSurtPrefixSet(S urtPrefixedDecideRule.java:254) at org.archive.crawler.deciderules.SurtPrefixedDecideRule.readPrefixes(SurtPre fixedDecideRule.java:186) at org.archive.crawler.deciderules.SurtPrefixedDecideRule.getPrefixes(SurtPref ixedDecideRule.java:180) at org.archive.crawler.deciderules.SurtPrefixedDecideRule.evaluate(SurtPrefixe dDecideRule.java:150) at org.archive.crawler.deciderules.PredicatedDecideRule.decisionFor(Predicated DecideRule.java:41) at org.archive.crawler.deciderules.DecideRuleSequence.decisionFor(DecideRuleSe quence.java:70) at org.archive.crawler.deciderules.DecidingScope.innerAccepts(DecidingScope.ja va:68) at org.archive.crawler.framework.Filter.accepts(Filter.java:90) at org.archive.crawler.framework.Scoper.isInScope(Scoper.java:115) at org.archive.crawler.prefetch.Preselector.innerProcess(Preselector.java:115) at org.archive.crawler.framework.Processor.process(Processor.java:103) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:153)

    JIRA | 1 decade ago | Gordon Mohr
    gnu.inet.encoding.IDNAException: String too long. .online.fr

    Root Cause Analysis

    1. gnu.inet.encoding.IDNAException

      String too long. .online.fr

      at org.archive.net.UURIFactory.fixupDomainlabel()
    2. webarchive-commons
      SurtPrefixSet.importFromMixed
      1. org.archive.net.UURIFactory.fixupDomainlabel(UURIFactory.java:622)
      2. org.archive.net.UURIFactory.fixupAuthority(UURIFactory.java:572)
      3. org.archive.net.UURIFactory.fixup(UURIFactory.java:475)
      4. org.archive.net.UURIFactory.create(UURIFactory.java:319)
      5. org.archive.net.UURIFactory.create(UURIFactory.java:309)
      6. org.archive.net.UURIFactory.getInstance(UURIFactory.java:262)
      7. org.archive.util.SurtPrefixSet.prefixFromPlain(SurtPrefixSet.java:213)
      8. org.archive.util.SurtPrefixSet.addFromPlain(SurtPrefixSet.java:193)
      9. org.archive.util.SurtPrefixSet.importFromMixed(SurtPrefixSet.java:180)
      9 frames
    3. org.archive.crawler
      ToeThread.run
      1. org.archive.crawler.deciderules.SurtPrefixedDecideRule.buildSurtPrefixSet(SurtPrefixedDecideRule.java:254)
      2. org.archive.crawler.deciderules.SurtPrefixedDecideRule.readPrefixes(SurtPrefixedDecideRule.java:186)
      3. org.archive.crawler.deciderules.SurtPrefixedDecideRule.getPrefixes(SurtPrefixedDecideRule.java:180)
      4. org.archive.crawler.deciderules.SurtPrefixedDecideRule.evaluate(SurtPrefixedDecideRule.java:150)
      5. org.archive.crawler.deciderules.PredicatedDecideRule.decisionFor(PredicatedDecideRule.java:41)
      6. org.archive.crawler.deciderules.DecideRuleSequence.decisionFor(DecideRuleSequence.java:70)
      7. org.archive.crawler.deciderules.DecidingScope.innerAccepts(DecidingScope.java:68)
      8. org.archive.crawler.framework.Filter.accepts(Filter.java:90)
      9. org.archive.crawler.framework.Scoper.isInScope(Scoper.java:115)
      10. org.archive.crawler.prefetch.Preselector.innerProcess(Preselector.java:115)
      11. org.archive.crawler.framework.Processor.process(Processor.java:103)
      12. org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306)
      13. org.archive.crawler.framework.ToeThread.run(ToeThread.java:153)
      13 frames