[opencms-dev] hit.getExceprt() returns "null"
Christoph P. Kukulies
kuku at physik.rwth-aachen.de
Fri Dec 18 12:33:25 CET 2009
On Fri, Dec 18, 2009 at 10:41:37AM +0100, Mario J??ger wrote:
> Hi Christoph,
>
> Are there any entries in the OpenCms logfile WEB-INF/logs/opencms.log
> when you rebuild the search index?
Ah, good idea to look there:
18 Dez 2009 12:24:43,429 INFO [earch.CmsIndexingThreadManager: 263] Indexing st
atistics: indexed files: 61, returned threads: 61, abandoned threads: 0, duratio
n: 00:00:06
18 Dez 2009 12:24:54,273 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:91)
at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
Document.java:210)
at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
Caused by: java.io.IOException: You do not have permission to extract text
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:189)
at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:140)
at org.opencms.search.extractors.CmsExtractorPdf.extractText(CmsExtracto
rPdf.java:104)
at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
tExtractor.java:72)
at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
tExtractor.java:62)
at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:78)
... 2 more
18 Dez 2009 12:24:55,289 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
tPdf.java:91)
at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
Document.java:210)
at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
Caused by: java.io.IOException: You do not have permission to extract text
--
Chris Christoph P. U. Kukulies kukulies (at) rwth-aachen.de
More information about the opencms-dev
mailing list