[opencms-dev] hit.getExceprt() returns "null"

Christoph P. Kukulies kuku at physik.rwth-aachen.de
Fri Dec 18 12:33:25 CET 2009


On Fri, Dec 18, 2009 at 10:41:37AM +0100, Mario J??ger wrote:
> Hi Christoph,
>
> Are there any entries in the OpenCms logfile WEB-INF/logs/opencms.log  
> when you rebuild the search index?

Ah, good idea to look there:

18 Dez 2009 12:24:43,429  INFO [earch.CmsIndexingThreadManager: 263] Indexing st
atistics: indexed files: 61, returned threads: 61, abandoned threads: 0, duratio
n: 00:00:06
18 Dez 2009 12:24:54,273 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
        at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:91)
        at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
Document.java:210)
        at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
Caused by: java.io.IOException: You do not have permission to extract text
        at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:189)
        at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:140)
        at org.opencms.search.extractors.CmsExtractorPdf.extractText(CmsExtracto
rPdf.java:104)
        at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
tExtractor.java:72)
        at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
tExtractor.java:62)
        at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:78)
        ... 2 more
18 Dez 2009 12:24:55,289 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
        at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:91)
        at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
Document.java:210)
        at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
Caused by: java.io.IOException: You do not have permission to extract text


--
Chris Christoph P. U. Kukulies kukulies (at) rwth-aachen.de



More information about the opencms-dev mailing list