[opencms-dev] PDFDocument and WordDocument from Ernesto De Santis

vsouksav at csc.com.au vsouksav at csc.com.au
Thu Nov 13 01:29:01 CET 2003


Hi Stephan,

Could you please make available the source code for the above class from
Ernesto. I am also looking at indexing the content of these binary files.
I was able to index the attributes of these files using Lucene 1.4 and the
BodylessDocument class from your previous direction.

Another question please, what is the quickest way to return all search
results encompassing text within the body and the attributes satisfying the
criteria. For example,

simple_search.jsp?q=JDBC returns content that has the word "JDBC" in the
body.
whereas
simple_search.jsp?q=title:JDBC returns content with the word "JDBC" in the
tile.

Thanks very much

Regards,
Valouny Souksavat
_________________________________________________

----------------------------------------------------------------------------------------

This is a PRIVATE message. If you are not the intended recipient, please
delete without copying and kindly advise us by e-mail of the mistake in
delivery. NOTE: Regardless of content, this e-mail shall not operate to
bind CSC to any order or other contract unless pursuant to explicit written
agreement or government initiative expressly permitting the use of e-mail
for such purpose.
----------------------------------------------------------------------------------------





More information about the opencms-dev mailing list