java.lang.NoClassDefFoundError: org/apache/commons/io/IOUtils

blogjava.net | 1 week ago
  1. 0

    I have encountered two types of error due to editing configuration of running crawls that have TimespanCriteria refinements. The first appears in the errors log, the second from the edit configuration UI. Each of my crawls had two refinements that operate on disjoint times of the day to simply change the min-delay. The full exceptions for each are below. HOW TO RECREATE --------------- Create a crawl with two TimespanCriteria refinements, one from 1030-1600 and the other from 1601-2000 that change only the min-delay. Start the crawl, then go to "Jobs -> Edit Configuration". Change anything, and click "Finished". Wait while for a while and watch the logs. In the errors log: 2006-07-19T21:56:21.845Z -5 65145 http://www.someserver.com/page2 RLLL http://www.someserver.com/page1 text/html #037 20060719215120589+301214 Z6BXQIEJEWV7F3FDZ2W6CFNZ7B7UUUYI http://www.someserver.com/ timeTrunc,err=java.lang.NumberFormatException java.lang.NumberFormatException: For input string: "" at java.lang.NumberFormatException.forInputString(NumberFormatException.java:4 8) at java.lang.Long.parseLong(Long.java:424) at java.lang.Long.parseLong(Long.java:461) at java.text.DigitList.getLong(DigitList.java:167) at java.text.DecimalFormat.parse(DecimalFormat.java:1271) at java.text.SimpleDateFormat.subParse(SimpleDateFormat.java:1690) at java.text.SimpleDateFormat.parse(SimpleDateFormat.java:1124) at java.text.DateFormat.parse(DateFormat.java:333) at org.archive.crawler.settings.refinements.TimespanCriteria.isWithinRefinemen tBounds(TimespanCriteria.java:73) at org.archive.crawler.settings.refinements.Refinement.isWithinRefinementBound s(Refinement.java:90) at org.archive.crawler.settings.SettingsHandler.getRefinementsForSettings(Sett ingsHandler.java:296) at org.archive.crawler.settings.SettingsHandler.getSettings(SettingsHandler.ja va:271) at org.archive.crawler.settings.ComplexType.getSettingsFromObject(ComplexType. java:331) at org.archive.crawler.settings.ComplexType.getSettingsFromObject(ComplexType. java:371) at org.archive.crawler.settings.ComplexType.getAttribute(ComplexType.java:449) at org.archive.crawler.settings.ComplexType.getAttribute(ComplexType.java:429) at org.archive.crawler.framework.Processor.process(Processor.java:95) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:153) Or, from the edit configuration UI: java.lang.ArrayIndexOutOfBoundsException: 437 at sun.util.calendar.BaseCalendar.getCalendarDateFromFixedDate(BaseCalendar.ja va:436) at java.util.GregorianCalendar.computeFields(GregorianCalendar.java:2080) at java.util.GregorianCalendar.computeTime(GregorianCalendar.java:2471) at java.util.Calendar.updateTime(Calendar.java:2260) at java.util.Calendar.complete(Calendar.java:1305) at java.util.Calendar.get(Calendar.java:1088) at java.text.SimpleDateFormat.subFormat(SimpleDateFormat.java:899) at java.text.SimpleDateFormat.format(SimpleDateFormat.java:806) at java.text.SimpleDateFormat.format(SimpleDateFormat.java:778) at java.text.DateFormat.format(DateFormat.java:314) at org.archive.crawler.settings.refinements.TimespanCriteria.isWithinRefinemen tBounds(TimespanCriteria.java:73) at org.archive.crawler.settings.refinements.Refinement.isWithinRefinementBound s(Refinement.java:90) at org.archive.crawler.settings.SettingsHandler.getRefinementsForSettings(Sett ingsHandler.java:296) at org.archive.crawler.settings.SettingsHandler.getSettings(SettingsHandler.ja va:271) at org.archive.crawler.settings.ComplexType.getSettingsFromObject(ComplexType. java:331) at org.archive.crawler.settings.ComplexType.getSettingsFromObject(ComplexType. java:371) at org.archive.crawler.settings.ComplexType.getAttribute(ComplexType.java:449) at org.archive.crawler.url.canonicalize.BaseRule.isEnabled(BaseRule.java:68) at org.archive.crawler.url.Canonicalizer.canonicalize(Canonicalizer.java:90) at org.archive.crawler.url.Canonicalizer.canonicalize(Canonicalizer.java:65) at org.archive.crawler.frontier.AbstractFrontier.canonicalize(AbstractFrontier .java:969) at org.archive.crawler.frontier.AbstractFrontier.canonicalize(AbstractFrontier .java:988) at org.archive.crawler.frontier.WorkQueueFrontier.schedule(WorkQueueFrontier.j ava:363) at org.archive.crawler.frontier.AbstractFrontier.loadSeeds(AbstractFrontier.ja va:550) at org.archive.crawler.framework.CrawlScope.kickUpdate(CrawlScope.java:244) at org.archive.crawler.deciderules.DecidingScope.kickUpdate(DecidingScope.java :76) at org.archive.crawler.framework.CrawlController.kickUpdate(CrawlController.ja va:1630) at org.archive.crawler.admin.CrawlJob.kickUpdate(CrawlJob.java:1948) at org.archive.crawler.admin.CrawlJobHandler.kickUpdate(CrawlJobHandler.java:1 154) at org.archive.crawler.jspc.admin.jobs.configure_jsp._jspService(Unknown Source) at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137) at javax.servlet.http.HttpServlet.service(HttpServlet.java:853) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicati onHandler.java:342) at org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67)

    JIRA | 10 years ago | (sourceforge)
    java.lang.ArrayIndexOutOfBoundsException: 437
  2. 0

    Igor sees this on crawling006. When I try to create a new job based on an existing job I get FatalConfigurationException. An error occured org.archive.crawler.framework.exceptions.FatalConfigurationException: IOException occured while writing new settings files for new job/profile Invalid argument org.archive.crawler.framework.exceptions.FatalConfigurationException: IOException occured while writing new settings files for new job/profile Invalid argument at org.archive.crawler.admin.CrawlJobHandler.makeNew(CrawlJobHandler.java:834) at org.archive.crawler.admin.CrawlJobHandler.newJob(CrawlJobHandler.java:706) at org.archive.crawler.jspc.admin.jobs.new_jsp._jspService(Unknown Source) at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137) at javax.servlet.http.HttpServlet.service(HttpServlet.java:853) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicati onHandler.java:342) at org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicati onHandler.java:334) at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHand ler.java:286) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567) at org.mortbay.http.HttpContext.handle(HttpContext.java:1807) at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContex t.java:525) at org.mortbay.http.HttpContext.handle(HttpContext.java:1757) at org.mortbay.http.HttpServer.service(HttpServer.java:879) at org.mortbay.http.HttpConnection.service(HttpConnection.java:789) at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:960) at org.mortbay.http.HttpConnection.handle(HttpConnection.java:806) at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:218) at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:300) at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:511)

    JIRA | 10 years ago | Michael Stack
    org.archive.crawler.framework.exceptions.FatalConfigurationException: IOException occured while writing new settings files for new job/profile Invalid argument
  3. Speed up your debug routine!

    Automated exception search integrated into your IDE

    12 unregistered visitors
    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. java.lang.ClassNotFoundException

      org.apache.commons.io.IOUtils

      at java.net.URLClassLoader$1.run()
    2. Java RT
      ClassLoader.loadClassInternal
      1. java.net.URLClassLoader$1.run(URLClassLoader.java:200)
      2. java.security.AccessController.doPrivileged(Native Method)
      3. java.net.URLClassLoader.findClass(URLClassLoader.java:188)
      4. java.lang.ClassLoader.loadClass(ClassLoader.java:307)
      5. sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
      6. java.lang.ClassLoader.loadClass(ClassLoader.java:252)
      7. java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320)
      7 frames
    3. org.archive.crawler
      CrawlJobHandler.newJob
      1. org.archive.crawler.settings.XMLSettingsHandler.writeSettingsObject(XMLSettingsHandler.java:260)
      2. org.archive.crawler.settings.XMLSettingsHandler.writeSettingsObject(XMLSettingsHandler.java:208)
      3. org.archive.crawler.settings.XMLSettingsHandler.copySettings(XMLSettingsHandler.java:360)
      4. org.archive.crawler.admin.CrawlJobHandler.createSettingsHandler(CrawlJobHandler.java:889)
      5. org.archive.crawler.admin.CrawlJobHandler.createNewJob(CrawlJobHandler.java:781)
      6. org.archive.crawler.admin.CrawlJobHandler.newJob(CrawlJobHandler.java:730)
      6 frames
    4. org.apache.jsp
      new_jsp._jspService
      1. org.apache.jsp.new_jsp._jspService(new_jsp.java:185)
      1 frame
    5. GWT dev
      HttpJspBase.service
      1. org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137)
      1 frame
    6. JavaServlet
      HttpServlet.service
      1. javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
      1 frame
    7. GWT dev
      JspServlet.service
      1. org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:210)
      2. org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:295)
      3. org.apache.jasper.servlet.JspServlet.service(JspServlet.java:241)
      3 frames
    8. JavaServlet
      HttpServlet.service
      1. javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
      1 frame
    9. Jetty Server
      WebApplicationHandler$Chain.doFilter
      1. org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358)
      2. org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:342)
      2 frames
    10. org.archive.crawler
      RootFilter.doFilter
      1. org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67)
      1 frame
    11. Jetty Server
      ServletHandler.handle
      1. org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:334)
      2. org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:286)
      3. org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
      3 frames
    12. org.mortbay.http
      HttpContext.handle
      1. org.mortbay.http.HttpContext.handle(HttpContext.java:1807)
      1 frame
    13. Jetty Server
      WebApplicationContext.handle
      1. org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:525)
      1 frame
    14. org.mortbay.http
      SocketListener.handleConnection
      1. org.mortbay.http.HttpContext.handle(HttpContext.java:1757)
      2. org.mortbay.http.HttpServer.service(HttpServer.java:879)
      3. org.mortbay.http.HttpConnection.service(HttpConnection.java:789)
      4. org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:960)
      5. org.mortbay.http.HttpConnection.handle(HttpConnection.java:806)
      6. org.mortbay.http.SocketListener.handleConnection(SocketListener.java:218)
      6 frames
    15. Jetty Util
      ThreadPool$PoolThread.run
      1. org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:300)
      2. org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:511)
      2 frames