[opencms-dev] hit.getExceprt() returns "null"
Mario Jäger
m.jaeger at alkacon.com
Fri Dec 18 13:00:00 CET 2009
Hi Christoph,
Thank you for the stack trace. I only can reproduce your problem,
when I try to index a PDF-file with any security settings. When you
open your Artikel.pdf; is there to see anything like that "(PROTECTED)"
or "(GESCHÜTZT)"? Is there any button with the label "Security settings"
or "Sicherheitseinstellungen"?
--
Kind Regards,
Mario.
-------------------
Mario Jäger
Alkacon Software GmbH - The OpenCms Experts
http://www.alkacon.com - http://www.opencms.org
Christoph P. Kukulies schrieb:
> On Fri, Dec 18, 2009 at 10:41:37AM +0100, Mario J??ger wrote:
>
>> Hi Christoph,
>>
>> Are there any entries in the OpenCms logfile WEB-INF/logs/opencms.log
>> when you rebuild the search index?
>>
>
> Ah, good idea to look there:
>
> 18 Dez 2009 12:24:43,429 INFO [earch.CmsIndexingThreadManager: 263] Indexing st
> atistics: indexed files: 61, returned threads: 61, abandoned threads: 0, duratio
> n: 00:00:06
> 18 Dez 2009 12:24:54,273 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
> text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
> org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel.pdf" failed.
> at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
> tPdf.java:91)
> at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
> Document.java:210)
> at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
> Caused by: java.io.IOException: You do not have permission to extract text
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:189)
> at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:140)
> at org.opencms.search.extractors.CmsExtractorPdf.extractText(CmsExtracto
> rPdf.java:104)
> at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
> tExtractor.java:72)
> at org.opencms.search.extractors.A_CmsTextExtractor.extractText(A_CmsTex
> tExtractor.java:62)
> at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
> tPdf.java:78)
> ... 2 more
> 18 Dez 2009 12:24:55,289 ERROR [rch.documents.A_CmsVfsDocument: 217] Extracting
> text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
> org.opencms.search.CmsIndexException: Extracting text from resource "/sites/company/download/de/new/Artikel1.pdf" failed.
> at org.opencms.search.documents.CmsDocumentPdf.extractContent(CmsDocumen
> tPdf.java:91)
> at org.opencms.search.documents.A_CmsVfsDocument.createDocument(A_CmsVfs
> Document.java:210)
> at org.opencms.search.CmsIndexingThread.run(CmsIndexingThread.java:129)
> Caused by: java.io.IOException: You do not have permission to extract text
>
>
> --
> Chris Christoph P. U. Kukulies kukulies (at) rwth-aachen.de
>
More information about the opencms-dev
mailing list