com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory)

JIRA | Gordon Mohr | 9 years ago
  1. 0

    After using the web ui job 'delete' link, this appeared on STDOUT: Oct 9, 2007 8:51:06 PM org.archive.settings.file.BdbModule close2 SEVERE: Error closing environment. com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory) at com.sleepycat.je.log.FileManager$LogEndFileDescriptor.getWritableFile(FileManager.java:1655) at com.sleepycat.je.log.FileManager.writeLogBuffer(FileManager.java:908) at com.sleepycat.je.log.LogBufferPool.writeBufferToFile(LogBufferPool.java:219) at com.sleepycat.je.log.LogBufferPool.writeCompleted(LogBufferPool.java:275) at com.sleepycat.je.log.LogManager.logInternal(LogManager.java:507) at com.sleepycat.je.log.SyncedLogManager.logItem(SyncedLogManager.java:51) at com.sleepycat.je.log.LogManager.log(LogManager.java:257) at com.sleepycat.je.log.LogManager.logForceFlush(LogManager.java:129) at com.sleepycat.je.recovery.Checkpointer.doCheckpoint(Checkpointer.java:479) at com.sleepycat.je.dbi.EnvironmentImpl.invokeCheckpoint(EnvironmentImpl.java:1158) at com.sleepycat.je.Environment.sync(Environment.java:859) at org.archive.settings.file.BdbModule.close2(BdbModule.java:570) at org.archive.settings.file.BdbModule.close(BdbModule.java:541) at org.archive.settings.file.FileSheetManager.offlineCleanup(FileSheetManager.java:949) at org.archive.settings.jmx.JMXSheetManagerImpl.offlineCleanup(JMXSheetManagerImpl.java:555) at org.archive.settings.jmx.JMXSheetManagerImpl$ReapTask.run(JMXSheetManagerImpl.java:594) at java.util.TimerThread.mainLoop(Timer.java:512) at java.util.TimerThread.run(Timer.java:462) Caused by: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory) at com.sleepycat.je.log.FileManager.makeFileHandle(FileManager.java:679) at com.sleepycat.je.log.FileManager.access$500(FileManager.java:47) at com.sleepycat.je.log.FileManager$LogEndFileDescriptor.getWritableFile(FileManager.java:1638) ... 17 more

    JIRA | 9 years ago | Gordon Mohr
    com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory)
  2. 0

    After using the web ui job 'delete' link, this appeared on STDOUT: Oct 9, 2007 8:51:06 PM org.archive.settings.file.BdbModule close2 SEVERE: Error closing environment. com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory) at com.sleepycat.je.log.FileManager$LogEndFileDescriptor.getWritableFile(FileManager.java:1655) at com.sleepycat.je.log.FileManager.writeLogBuffer(FileManager.java:908) at com.sleepycat.je.log.LogBufferPool.writeBufferToFile(LogBufferPool.java:219) at com.sleepycat.je.log.LogBufferPool.writeCompleted(LogBufferPool.java:275) at com.sleepycat.je.log.LogManager.logInternal(LogManager.java:507) at com.sleepycat.je.log.SyncedLogManager.logItem(SyncedLogManager.java:51) at com.sleepycat.je.log.LogManager.log(LogManager.java:257) at com.sleepycat.je.log.LogManager.logForceFlush(LogManager.java:129) at com.sleepycat.je.recovery.Checkpointer.doCheckpoint(Checkpointer.java:479) at com.sleepycat.je.dbi.EnvironmentImpl.invokeCheckpoint(EnvironmentImpl.java:1158) at com.sleepycat.je.Environment.sync(Environment.java:859) at org.archive.settings.file.BdbModule.close2(BdbModule.java:570) at org.archive.settings.file.BdbModule.close(BdbModule.java:541) at org.archive.settings.file.FileSheetManager.offlineCleanup(FileSheetManager.java:949) at org.archive.settings.jmx.JMXSheetManagerImpl.offlineCleanup(JMXSheetManagerImpl.java:555) at org.archive.settings.jmx.JMXSheetManagerImpl$ReapTask.run(JMXSheetManagerImpl.java:594) at java.util.TimerThread.mainLoop(Timer.java:512) at java.util.TimerThread.run(Timer.java:462) Caused by: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory) at com.sleepycat.je.log.FileManager.makeFileHandle(FileManager.java:679) at com.sleepycat.je.log.FileManager.access$500(FileManager.java:47) at com.sleepycat.je.log.FileManager$LogEndFileDescriptor.getWritableFile(FileManager.java:1638) ... 17 more

    JIRA | 9 years ago | Gordon Mohr
    com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory)
  3. 0

    I started heritrix 1.14.4 on Friday morning to crawl only text/html files, beginning with yahoo.com On Monday morning i saw that heritrix just stopped crawling, heap space looks fine. The log says (last entry): 2010-12-10T18:04:49.904Z -5 - http://kr.promotion.yahoo.com/worldcup2010/ RRLLLLLLLE http://kr.sports.yahoo.com/event/wc2010/ no-type #020 - - - err=com.sleepycat.util.RuntimeExceptionWrapper com.sleepycat.util.RuntimeExceptionWrapper: (JE 3.3.82) fetchTarget of 0x50/0x8590e3 parent IN=266461 IN class=com.sleepycat.je.tree.BIN lastFullVersion=0x77/0x6dcaa0 parent.getDirty()=true state=0 com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.82) 0x50/0x8590e3 (JE 3.3.82) Couldn't open file /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb: /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb (Too many open files) at com.sleepycat.collections.StoredContainer.convertException(StoredContainer.java:466) at com.sleepycat.collections.StoredContainer.getValue(StoredContainer.java:306) at com.sleepycat.collections.StoredMap.get(StoredMap.java:227) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:264) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:75) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:100) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:124) at org.archive.crawler.prefetch.PreconditionEnforcer.considerDnsPreconditions(PreconditionEnforcer.java:227) at org.archive.crawler.prefetch.PreconditionEnforcer.innerProcess(PreconditionEnforcer.java:111) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) but its the date from yesterday. I think there are some file handles left open - i use the write-to-harddisk-writer. I changed the /etc/security/limits.conf regarding the FAQ but the process is still doing nothing. It is just idle, consuming memory. when i go to "reports" in the web frontend i get: An error occured java.util.NoSuchElementException java.util.NoSuchElementException at java.util.TreeMap.key(TreeMap.java:1223) at java.util.TreeMap.firstKey(TreeMap.java:284) at java.util.TreeSet.first(TreeSet.java:394) at org.archive.crawler.framework.ToePool.singleLineReportTo(ToePool.java:276) at org.archive.util.ArchiveUtils.singleLineReport(ArchiveUtils.java:728) at org.archive.crawler.framework.ToePool.singleLineReport(ToePool.java:296) at org.archive.crawler.framework.CrawlController.oneLineReportThreads(CrawlController.java:1628) at org.archive.crawler.admin.CrawlJob.getThreadOneLine(CrawlJob.java:930) at org.archive.crawler.jspc.admin.reports_jsp._jspService(Unknown Source) at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137) at javax.servlet.http.HttpServlet.service(HttpServlet.java:853) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:342) at org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:334) at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:286) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567) at org.mortbay.http.HttpContext.handle(HttpContext.java:1807) at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:525) at org.mortbay.http.HttpContext.handle(HttpContext.java:1757) at org.mortbay.http.HttpServer.service(HttpServer.java:879) at org.mortbay.http.HttpConnection.service(HttpConnection.java:789) at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:960) at org.mortbay.http.HttpConnection.handle(HttpConnection.java:806) at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:218) at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:300) at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:511) When i go to "Logs" they are displayed as empty. But they aren't (last entry local-errors.log): 2010-12-11T01:20:43.747Z -2 - http://www.kzone.com.au/js/prototype.js RRLLLLRE http://www.kzone.com.au/ no-type #049 - - - le:IOException@HTTP java.io.IOException: RIS already open for ToeThread #49: http://www.kzone.com.au/js/prototype.js at org.archive.io.RecordingInputStream.open(RecordingInputStream.java:88) at org.archive.util.HttpRecorder.inputWrap(HttpRecorder.java:148) at org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:756) at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387) at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:346) at org.archive.crawler.fetcher.FetchHTTP.innerProcess(FetchHTTP.java:500) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) I'm not sure whats going on but i think its one bug which leads to so many problems. I will test it with heritrix 3 soon.

    JIRA | 6 years ago | oliver z.
    com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.82) 0x50/0x8590e3 (JE 3.3.82) Couldn't open file /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb: /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb (Too many open files)
  4. Speed up your debug routine!

    Automated exception search integrated into your IDE

  5. 0

    I started heritrix 1.14.4 on Friday morning to crawl only text/html files, beginning with yahoo.com On Monday morning i saw that heritrix just stopped crawling, heap space looks fine. The log says (last entry): 2010-12-10T18:04:49.904Z -5 - http://kr.promotion.yahoo.com/worldcup2010/ RRLLLLLLLE http://kr.sports.yahoo.com/event/wc2010/ no-type #020 - - - err=com.sleepycat.util.RuntimeExceptionWrapper com.sleepycat.util.RuntimeExceptionWrapper: (JE 3.3.82) fetchTarget of 0x50/0x8590e3 parent IN=266461 IN class=com.sleepycat.je.tree.BIN lastFullVersion=0x77/0x6dcaa0 parent.getDirty()=true state=0 com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.82) 0x50/0x8590e3 (JE 3.3.82) Couldn't open file /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb: /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb (Too many open files) at com.sleepycat.collections.StoredContainer.convertException(StoredContainer.java:466) at com.sleepycat.collections.StoredContainer.getValue(StoredContainer.java:306) at com.sleepycat.collections.StoredMap.get(StoredMap.java:227) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:264) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:75) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:100) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:124) at org.archive.crawler.prefetch.PreconditionEnforcer.considerDnsPreconditions(PreconditionEnforcer.java:227) at org.archive.crawler.prefetch.PreconditionEnforcer.innerProcess(PreconditionEnforcer.java:111) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) but its the date from yesterday. I think there are some file handles left open - i use the write-to-harddisk-writer. I changed the /etc/security/limits.conf regarding the FAQ but the process is still doing nothing. It is just idle, consuming memory. when i go to "reports" in the web frontend i get: An error occured java.util.NoSuchElementException java.util.NoSuchElementException at java.util.TreeMap.key(TreeMap.java:1223) at java.util.TreeMap.firstKey(TreeMap.java:284) at java.util.TreeSet.first(TreeSet.java:394) at org.archive.crawler.framework.ToePool.singleLineReportTo(ToePool.java:276) at org.archive.util.ArchiveUtils.singleLineReport(ArchiveUtils.java:728) at org.archive.crawler.framework.ToePool.singleLineReport(ToePool.java:296) at org.archive.crawler.framework.CrawlController.oneLineReportThreads(CrawlController.java:1628) at org.archive.crawler.admin.CrawlJob.getThreadOneLine(CrawlJob.java:930) at org.archive.crawler.jspc.admin.reports_jsp._jspService(Unknown Source) at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137) at javax.servlet.http.HttpServlet.service(HttpServlet.java:853) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:342) at org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:334) at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:286) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567) at org.mortbay.http.HttpContext.handle(HttpContext.java:1807) at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:525) at org.mortbay.http.HttpContext.handle(HttpContext.java:1757) at org.mortbay.http.HttpServer.service(HttpServer.java:879) at org.mortbay.http.HttpConnection.service(HttpConnection.java:789) at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:960) at org.mortbay.http.HttpConnection.handle(HttpConnection.java:806) at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:218) at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:300) at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:511) When i go to "Logs" they are displayed as empty. But they aren't (last entry local-errors.log): 2010-12-11T01:20:43.747Z -2 - http://www.kzone.com.au/js/prototype.js RRLLLLRE http://www.kzone.com.au/ no-type #049 - - - le:IOException@HTTP java.io.IOException: RIS already open for ToeThread #49: http://www.kzone.com.au/js/prototype.js at org.archive.io.RecordingInputStream.open(RecordingInputStream.java:88) at org.archive.util.HttpRecorder.inputWrap(HttpRecorder.java:148) at org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:756) at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387) at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:346) at org.archive.crawler.fetcher.FetchHTTP.innerProcess(FetchHTTP.java:500) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) I'm not sure whats going on but i think its one bug which leads to so many problems. I will test it with heritrix 3 soon.

    JIRA | 6 years ago | oliver z.
    com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.82) 0x50/0x8590e3 (JE 3.3.82) Couldn't open file /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb: /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb (Too many open files)
  6. 0

    Environment invalid because of previous exception: com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.87) Couldn't open file /opt/tomcat/teamsysdata/0000000a.jdb: /opt/tomcat/teamsysdata/0000000a.jdb (No such file or directory)

    YouTrack | 7 years ago
    com.sleepycat.je.RunRecoveryException: com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.87) Couldn't open file /opt/tomcat/teamsysdata/0000000a.jdb: /opt/tomcat/teamsysdata/0000000a.jdb (No such file or directory)

    Root Cause Analysis

    1. com.sleepycat.je.log.LogFileNotFoundException

      (JE 3.2.44) Couldn't open file /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb: /home/gojomo/heritrix/h2-jobs/ready-basic_seed_sites-20071009202922/state/00000000.jdb (No such file or directory)

      at com.sleepycat.je.log.FileManager.makeFileHandle()
    2. Berkeley DB Java Edition
      Environment.sync
      1. com.sleepycat.je.log.FileManager.makeFileHandle(FileManager.java:679)
      2. com.sleepycat.je.log.FileManager.access$500(FileManager.java:47)
      3. com.sleepycat.je.log.FileManager$LogEndFileDescriptor.getWritableFile(FileManager.java:1638)
      4. com.sleepycat.je.log.FileManager.writeLogBuffer(FileManager.java:908)
      5. com.sleepycat.je.log.LogBufferPool.writeBufferToFile(LogBufferPool.java:219)
      6. com.sleepycat.je.log.LogBufferPool.writeCompleted(LogBufferPool.java:275)
      7. com.sleepycat.je.log.LogManager.logInternal(LogManager.java:507)
      8. com.sleepycat.je.log.SyncedLogManager.logItem(SyncedLogManager.java:51)
      9. com.sleepycat.je.log.LogManager.log(LogManager.java:257)
      10. com.sleepycat.je.log.LogManager.logForceFlush(LogManager.java:129)
      11. com.sleepycat.je.recovery.Checkpointer.doCheckpoint(Checkpointer.java:479)
      12. com.sleepycat.je.dbi.EnvironmentImpl.invokeCheckpoint(EnvironmentImpl.java:1158)
      13. com.sleepycat.je.Environment.sync(Environment.java:859)
      13 frames
    3. org.archive.settings
      JMXSheetManagerImpl$ReapTask.run
      1. org.archive.settings.file.BdbModule.close2(BdbModule.java:570)
      2. org.archive.settings.file.BdbModule.close(BdbModule.java:541)
      3. org.archive.settings.file.FileSheetManager.offlineCleanup(FileSheetManager.java:949)
      4. org.archive.settings.jmx.JMXSheetManagerImpl.offlineCleanup(JMXSheetManagerImpl.java:555)
      5. org.archive.settings.jmx.JMXSheetManagerImpl$ReapTask.run(JMXSheetManagerImpl.java:594)
      5 frames
    4. Java RT
      TimerThread.run
      1. java.util.TimerThread.mainLoop(Timer.java:512)
      2. java.util.TimerThread.run(Timer.java:462)
      2 frames