java.io.IOException

There are no available Samebug tips for this exception. Do you have an idea how to solve this issue? A short tip would help users who saw this issue last week.

  • At least one of the exceptions seems to be caused by an RTF document attachment. I verified the document opens correctly in both Word and TextEdit so it should be valid. Sep 10, 2004 10:15:19 AM bucket.search.lucene.LuceneIndexer indexAll SEVERE: Erroring indexing object: com.atlassian.confluence.pages.Attachment@9e java.lang.NullPointerException at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Sep 10, 2004 10:15:19 AM bucket.search.lucene.LuceneIndexer indexAll SEVERE: Erroring indexing object: com.atlassian.confluence.pages.Attachment@aa java.lang.NullPointerException at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Sep 10, 2004 10:15:20 AM bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory getAttachmentStringContent SEVERE: Error extracting text from attachment: foobar.rtf java.io.IOException: Invalid header signature; read 7015536635646467195, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:125) at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:120) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:32) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:104) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:53) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:12) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Also seeing lots of errors mentioned in CONF-1713
    via by Sulka Haro,
  • At least one of the exceptions seems to be caused by an RTF document attachment. I verified the document opens correctly in both Word and TextEdit so it should be valid. Sep 10, 2004 10:15:19 AM bucket.search.lucene.LuceneIndexer indexAll SEVERE: Erroring indexing object: com.atlassian.confluence.pages.Attachment@9e java.lang.NullPointerException at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Sep 10, 2004 10:15:19 AM bucket.search.lucene.LuceneIndexer indexAll SEVERE: Erroring indexing object: com.atlassian.confluence.pages.Attachment@aa java.lang.NullPointerException at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Sep 10, 2004 10:15:20 AM bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory getAttachmentStringContent SEVERE: Error extracting text from attachment: foobar.rtf java.io.IOException: Invalid header signature; read 7015536635646467195, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:125) at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:120) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:32) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:104) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:53) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:12) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534) Also seeing lots of errors mentioned in CONF-1713
    via by Sulka Haro,
  • Document opens fine. Only one version in the attchments dir so I know I was opening the correct one. None of its text is indexed. 2005-01-20 15:54:11,423 ERROR [search.lucene.mapping.AttachmentLuceneDocumentFactory] Error extracting text from attachment: WSAD 5.0 setup.rtf java.io.IOException: Invalid header signature; read 7015536635646467195, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:125) at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:120) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:48) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:107) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:56) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.getDocument(LuceneIndexer.java:156) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:94) at com.atlassian.confluence.search.lucene.ConfluenceIndexer.indexAll(ConfluenceIndexer.java:47) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:324) at org.springframework.aop.framework.AopProxyUtils.invokeJoinpointUsingReflection(AopProxyUtils.java:61) at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:149) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:116) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:152) at $Proxy22.indexEntities(Unknown Source) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)
    via by RefuX Zanzeebarr,
  • Document opens fine. Only one version in the attchments dir so I know I was opening the correct one. None of its text is indexed. 2005-01-20 15:54:11,423 ERROR [search.lucene.mapping.AttachmentLuceneDocumentFactory] Error extracting text from attachment: WSAD 5.0 setup.rtf java.io.IOException: Invalid header signature; read 7015536635646467195, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:125) at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:120) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:48) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:107) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:56) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:14) at bucket.search.lucene.LuceneIndexer.getDocument(LuceneIndexer.java:156) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:94) at com.atlassian.confluence.search.lucene.ConfluenceIndexer.indexAll(ConfluenceIndexer.java:47) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:324) at org.springframework.aop.framework.AopProxyUtils.invokeJoinpointUsingReflection(AopProxyUtils.java:61) at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:149) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:116) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:152) at $Proxy22.indexEntities(Unknown Source) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)
    via by RefuX Zanzeebarr,
  • We have found that update the pdfbox library to the last stable version (1.2.1) solve all our current issues with pdf text extraction and improve performance. This could help people that want rely on the DSpace "out-of-box" pdf extractor without using XPDF. Below some samples of exception that go away updating the pdfbox version. Patch attached against trunk r5439 == java.io.IOException: Error: Could not find font(COSName{F1.0}) in map={} at org.pdfbox.util.operator.SetTextFont.process(SetTextFont.java:83) at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.ClassCastException: org.pdfbox.cos.COSArray cannot be cast to org.pdfbox.cos.COSDictionary at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:70) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:243) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101) at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) ==== java.io.IOException: Unknown colorspace array type:COSName{DeviceRGB} at org.pdfbox.pdmodel.graphics.color.PDColorSpaceFactory.createColorSpace(PDColorSpaceFactory.java:116) at org.pdfbox.pdmodel.PDResources.getColorSpaces(PDResources.java:264) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:193) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.NullPointerException at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194) at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:182) at org.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.java:226) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.util.zip.ZipException: unknown compression method at java.util.zip.InflaterInputStream.read(Unknown Source) at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFObjectStreamParser.<init>(PDFObjectStreamParser.java:66) at org.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:450) at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:908) at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.lang.ArrayIndexOutOfBoundsException at java.lang.System.arraycopy(Native Method) at java.io.PushbackInputStream.unread(Unknown Source) at org.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:524) at org.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:873) at org.pdfbox.pdfparser.PDFObjectStreamParser.parse(PDFObjectStreamParser.java:94) at org.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:451) at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:908) at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139) === java.io.EOFException: Unexpected end of ZLIB input stream at java.util.zip.InflaterInputStream.fill(Unknown Source) at java.util.zip.InflaterInputStream.read(Unknown Source) at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290) at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235) at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170) at org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101) at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132) at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202) at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139)
    via by Andrea Bollini,
  • problem with POI
    via by itsmenikhil itsmenikhil,
    • java.io.IOException: Invalid header signature; read 7015536635646467195, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:125) at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:120) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:32) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.getAttachmentStringContent(AttachmentLuceneDocumentFactory.java:104) at bucket.search.lucene.mapping.AttachmentLuceneDocumentFactory.createDocument(AttachmentLuceneDocumentFactory.java:53) at com.atlassian.confluence.search.lucene.ConfluenceAttachmentLuceneDocumentFactory.createDocument(ConfluenceAttachmentLuceneDocumentFactory.java:12) at bucket.search.lucene.LuceneIndexer.indexSingleObject(LuceneIndexer.java:239) at bucket.search.lucene.LuceneIndexer.indexAll(LuceneIndexer.java:112) at bucket.search.lucene.AbstractBatchIndexer.indexEntities(AbstractBatchIndexer.java:57) at bucket.search.lucene.AbstractBatchIndexer$$FastClassByCGLIB$$e0fab4d1.invoke(<generated>) at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:189) at org.springframework.aop.framework.Cglib2AopProxy$MethodInvocationImpl.invokeJoinpoint(Cglib2AopProxy.java:393) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:118) at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:191) at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:138) at org.springframework.aop.framework.Cglib2AopProxy.intercept(Cglib2AopProxy.java:144) at com.atlassian.confluence.search.lucene.BatchIndexer$$EnhancerByCGLIB$$ecef86e2.indexEntities(<generated>) at com.atlassian.confluence.search.IndexingTask.run(IndexingTask.java:24) at java.lang.Thread.run(Thread.java:534)

    Users with the same issue

    Unknown visitor
    Unknown visitor1 times, last one,
    Unknown visitor
    Unknown visitor1 times, last one,