java.util.NoSuchElementException

There are no available Samebug tips for this exception. Do you have an idea how to solve this issue? A short tip would help users who saw this issue last week.

  • Word Document not being indexed. There is only one copy of "Database Versioning - Howto.doc" in the attachments dir and it opens up in word just fine. However it I search in confluence for text it contains, it is not in the results. I found this error in the log files: 2005-01-20 15:54:09,188 ERROR [search.lucene.mapping.AttachmentLuceneDocumentFactory] Error extracting text from attachment: Database Versioning - Howto.doc java.util.NoSuchElementException at java.util.AbstractList$Itr.next(AbstractList.java:426) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:150) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:107) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:56) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.getDocument(LuceneIndexer.java:156) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:94) at com.atlassian.confluence.search.lucene.ConfluenceIndexer.indexAll(ConfluenceIndexer.java:47) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:324) at org.springframework.aop.framework.AopProxyUtils.invokeJoinpointUsingReflection(AopProxyUtils.java:61) at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:149) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:116) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:152) at $Proxy22.indexEntities(Unknown Source) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)
    via by RefuX Zanzeebarr,
  • Word Document not being indexed. There is only one copy of "Database Versioning - Howto.doc" in the attachments dir and it opens up in word just fine. However it I search in confluence for text it contains, it is not in the results. I found this error in the log files: 2005-01-20 15:54:09,188 ERROR [search.lucene.mapping.AttachmentLuceneDocumentFactory] Error extracting text from attachment: Database Versioning - Howto.doc java.util.NoSuchElementException at java.util.AbstractList$Itr.next(AbstractList.java:426) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:150) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:107) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:56) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.getDocument(LuceneIndexer.java:156) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:94) at com.atlassian.confluence.search.lucene.ConfluenceIndexer.indexAll(ConfluenceIndexer.java:47) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:324) at org.springframework.aop.framework.AopProxyUtils.invokeJoinpointUsingReflection(AopProxyUtils.java:61) at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:149) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:116) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:152) at $Proxy22.indexEntities(Unknown Source) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)
    via by RefuX Zanzeebarr,
  • We have found that update the pdfbox library to the last stable version (1.2.1) solve all our current issues with pdf text extraction and improve performance. This could help people that want rely on the DSpace "out-of-box" pdf extractor without using XPDF. Below some samples of exception that go away updating the pdfbox version. Patch attached against trunk r5439 == java.io.IOException: Error: Could not find font(COSName{F1.0}) in map={} at org.pdfbox.util.operator.SetTextFont.process(SetTextFont.java:83) at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.ClassCastException: org.pdfbox.cos.COSArray cannot be cast to org.pdfbox.cos.COSDictionary at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:70) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:243) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101) at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) ==== java.io.IOException: Unknown colorspace array type:COSName{DeviceRGB} at org.pdfbox.pdmodel.graphics.color.PDColorSpaceFactory.createColorSpace(PDColorSpaceFactory.java:116) at org.pdfbox.pdmodel.PDResources.getColorSpaces(PDResources.java:264) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:193) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.NullPointerException at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194) at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:182) at org.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.java:226) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.util.zip.ZipException: unknown compression method at java.util.zip.InflaterInputStream.read(Unknown Source) at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFObjectStreamParser.<init>(PDFObjectStreamParser.java:66) at org.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:450) at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:908) at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.ArrayIndexOutOfBoundsException at java.lang.System.arraycopy(Native Method) at java.io.PushbackInputStream.unread(Unknown Source) at org.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:524) at org.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:873) at org.pdfbox.pdfparser.PDFObjectStreamParser.parse(PDFObjectStreamParser.java:94) at org.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:451) at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:908) at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.io.EOFException: Unexpected end of ZLIB input stream at java.util.zip.InflaterInputStream.fill(Unknown Source) at java.util.zip.InflaterInputStream.read(Unknown Source) at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101) at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139)
    via by Andrea Bollini,
  • NoSuchElement exception on cancelling evaluation
    via by Darin_Swanson,
  • NoSuchElementException in audit
    via GitHub by jshiell
    ,
    • java.util.NoSuchElementException at java.util.AbstractList$Itr.next(AbstractList.java:426) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:150) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:107) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:56) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.getDocument(LuceneIndexer.java:156) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:94) at com.atlassian.confluence.search.lucene.ConfluenceIndexer.indexAll(ConfluenceIndexer.java:47) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:324) at org.springframework.aop.framework.AopProxyUtils.invokeJoinpointUsingReflection(AopProxyUtils.java:61) at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:149) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:116) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:152) at $Proxy22.indexEntities(Unknown Source) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)

    Users with the same issue

    Unknown visitor1 times, last one,