com.jaeksoft.searchlib.SearchLibException: java.io.IOException: Already open for read

opensearchserve | ale-ale | 2 years ago
  1. 0

    java.io.IOException: Already open for write

    opensearchserve | 2 years ago | ale-ale
    com.jaeksoft.searchlib.SearchLibException: java.io.IOException: Already open for read
  2. 0

    ERROR: root - The build is already running

    GitHub | 10 months ago | Constantin07
    com.jaeksoft.searchlib.SearchLibException: The build is already running
  3. 0

    Ignored URLs

    opensearchserve | 2 years ago | lynie
    com.jaeksoft.searchlib.SearchLibException: com.jaeksoft.searchlib.web.ServletException: com.jaeksoft.searchlib.SearchLibException: java.util.InvalidPropertiesFormatException: org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 1; Content is not allowed in prolog.
  4. Speed up your debug routine!

    Automated exception search integrated into your IDE

  5. 0

    Error ( The unique value is missing (url))

    opensearchserve | 4 years ago | xiaopingfeng
    com.jaeksoft.searchlib.SearchLibException: The unique value is missing (url)
  6. 0

    I started heritrix 1.14.4 on Friday morning to crawl only text/html files, beginning with yahoo.com On Monday morning i saw that heritrix just stopped crawling, heap space looks fine. The log says (last entry): 2010-12-10T18:04:49.904Z -5 - http://kr.promotion.yahoo.com/worldcup2010/ RRLLLLLLLE http://kr.sports.yahoo.com/event/wc2010/ no-type #020 - - - err=com.sleepycat.util.RuntimeExceptionWrapper com.sleepycat.util.RuntimeExceptionWrapper: (JE 3.3.82) fetchTarget of 0x50/0x8590e3 parent IN=266461 IN class=com.sleepycat.je.tree.BIN lastFullVersion=0x77/0x6dcaa0 parent.getDirty()=true state=0 com.sleepycat.je.log.LogFileNotFoundException: (JE 3.3.82) 0x50/0x8590e3 (JE 3.3.82) Couldn't open file /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb: /home/oli/Desktop/heritrix-1.14.4/jobs/nixdefault-20101210100246162/state/00000050.jdb (Too many open files) at com.sleepycat.collections.StoredContainer.convertException(StoredContainer.java:466) at com.sleepycat.collections.StoredContainer.getValue(StoredContainer.java:306) at com.sleepycat.collections.StoredMap.get(StoredMap.java:227) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:264) at org.archive.util.ObjectIdentityBdbCache.getOrUse(ObjectIdentityBdbCache.java:75) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:100) at org.archive.crawler.datamodel.ServerCache.getServerFor(ServerCache.java:124) at org.archive.crawler.prefetch.PreconditionEnforcer.considerDnsPreconditions(PreconditionEnforcer.java:227) at org.archive.crawler.prefetch.PreconditionEnforcer.innerProcess(PreconditionEnforcer.java:111) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) but its the date from yesterday. I think there are some file handles left open - i use the write-to-harddisk-writer. I changed the /etc/security/limits.conf regarding the FAQ but the process is still doing nothing. It is just idle, consuming memory. when i go to "reports" in the web frontend i get: An error occured java.util.NoSuchElementException java.util.NoSuchElementException at java.util.TreeMap.key(TreeMap.java:1223) at java.util.TreeMap.firstKey(TreeMap.java:284) at java.util.TreeSet.first(TreeSet.java:394) at org.archive.crawler.framework.ToePool.singleLineReportTo(ToePool.java:276) at org.archive.util.ArchiveUtils.singleLineReport(ArchiveUtils.java:728) at org.archive.crawler.framework.ToePool.singleLineReport(ToePool.java:296) at org.archive.crawler.framework.CrawlController.oneLineReportThreads(CrawlController.java:1628) at org.archive.crawler.admin.CrawlJob.getThreadOneLine(CrawlJob.java:930) at org.archive.crawler.jspc.admin.reports_jsp._jspService(Unknown Source) at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137) at javax.servlet.http.HttpServlet.service(HttpServlet.java:853) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:358) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:342) at org.archive.crawler.admin.ui.RootFilter.doFilter(RootFilter.java:67) at org.mortbay.jetty.servlet.WebApplicationHandler$Chain.doFilter(WebApplicationHandler.java:334) at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:286) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567) at org.mortbay.http.HttpContext.handle(HttpContext.java:1807) at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:525) at org.mortbay.http.HttpContext.handle(HttpContext.java:1757) at org.mortbay.http.HttpServer.service(HttpServer.java:879) at org.mortbay.http.HttpConnection.service(HttpConnection.java:789) at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:960) at org.mortbay.http.HttpConnection.handle(HttpConnection.java:806) at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:218) at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:300) at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:511) When i go to "Logs" they are displayed as empty. But they aren't (last entry local-errors.log): 2010-12-11T01:20:43.747Z -2 - http://www.kzone.com.au/js/prototype.js RRLLLLRE http://www.kzone.com.au/ no-type #049 - - - le:IOException@HTTP java.io.IOException: RIS already open for ToeThread #49: http://www.kzone.com.au/js/prototype.js at org.archive.io.RecordingInputStream.open(RecordingInputStream.java:88) at org.archive.util.HttpRecorder.inputWrap(HttpRecorder.java:148) at org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:756) at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387) at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397) at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:346) at org.archive.crawler.fetcher.FetchHTTP.innerProcess(FetchHTTP.java:500) at org.archive.crawler.framework.Processor.process(Processor.java:109) at org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:154) I'm not sure whats going on but i think its one bug which leads to so many problems. I will test it with heritrix 3 soon.

    JIRA | 6 years ago | oliver z.
    java.io.IOException: RIS already open for ToeThread #49: http://www.kzone.com.au/js/prototype.js

    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. java.io.IOException

      Already open for read

      at com.jaeksoft.searchlib.cluster.VersionFile.lock()
    2. com.jaeksoft.searchlib
      ThreadAbstract.run
      1. com.jaeksoft.searchlib.cluster.VersionFile.lock(VersionFile.java:67)
      2. com.jaeksoft.searchlib.index.IndexLucene.updateDocuments(IndexLucene.java:300)
      3. com.jaeksoft.searchlib.Client.updateDocuments(Client.java:129)
      4. com.jaeksoft.searchlib.crawler.web.database.UrlManager.updateCrawlTarget(UrlManager.java:619)
      5. com.jaeksoft.searchlib.crawler.web.database.UrlCrawlQueue.updateCrawls(UrlCrawlQueue.java:192)
      6. com.jaeksoft.searchlib.crawler.web.database.UrlCrawlQueue.indexWork(UrlCrawlQueue.java:167)
      7. com.jaeksoft.searchlib.crawler.common.process.CrawlQueueAbstract.index(CrawlQueueAbstract.java:128)
      8. com.jaeksoft.searchlib.crawler.web.process.WebCrawlMaster.runner(WebCrawlMaster.java:158)
      9. com.jaeksoft.searchlib.process.ThreadAbstract.run(ThreadAbstract.java:263)
      9 frames
    3. Java RT
      Thread.run
      1. java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
      2. java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
      3. java.lang.Thread.run(Thread.java:744)
      3 frames