[opencms-dev] PDFDocument and WordDocument from Ernesto De Santis

Ernesto De Santis ernesto.desantis at colaborativa.net
Fri Nov 14 19:58:02 CET 2003


Hi.

> Yes, please do. If you don't mind, we'd really like to include it in the
> next version of the Lucene module.

Yes, of course.

Ernesto.



>
> Matt
>
> Ernesto De Santis wrote:
> > Hi Stephan
> >
> > I send the code to the list, in the past.
> >
> > do you want i send again?
> >
> > Ernesto.
> >
> >
> > ----- Original Message -----
> > From: "Hartmann, Waehrisch & Feykes GmbH" <hartmann at waehrisch-feykes.de>
> > To: <opencms-dev at opencms.org>
> > Sent: Thursday, November 13, 2003 5:14 AM
> > Subject: Re: [opencms-dev] PDFDocument and WordDocument from Ernesto De
> > Santis
> >
> >
> >
> >>Hi Valouny,
> >>
> >>i don't have the source code of Ernesto's classes.
> >>We are aware of the problem you mentioned and try to find a solution for
> >
> > the
> >
> >>next release.
> >>
> >>Bye,
> >>Stephan
> >>
> >>----- Original Message -----
> >>From: <vsouksav at csc.com.au>
> >>To: <opencms-dev at opencms.org>
> >>Sent: Thursday, November 13, 2003 1:16 AM
> >>Subject: [opencms-dev] PDFDocument and WordDocument from Ernesto De
Santis
> >>
> >>
> >>
> >>>Hi Stephan,
> >>>
> >>>Could you please make available the source code for the above class
from
> >>>Ernesto. I am also looking at indexing the content of these binary
> >
> > files.
> >
> >>>I was able to index the attributes of these files using Lucene 1.4 and
> >
> > the
> >
> >>>BodylessDocument class from your previous direction.
> >>>
> >>>Another question please, what is the quickest way to return all search
> >>>results encompassing text within the body and the attributes satisfying
> >>
> >>the
> >>
> >>>criteria. For example,
> >>>
> >>>simple_search.jsp?q=JDBC returns content that has the word "JDBC" in
the
> >>>body.
> >>>whereas
> >>>simple_search.jsp?q=title:JDBC returns content with the word "JDBC" in
> >
> > the
> >
> >>>tile.
> >>>
> >>>Thanks very much
> >>>
> >>>Regards,
> >>>Valouny Souksavat
> >>>_________________________________________________
> >>>
> >>
>
>>--------------------------------------------------------------------------
> >>--------------
> >>
> >>>This is a PRIVATE message. If you are not the intended recipient,
please
> >>>delete without copying and kindly advise us by e-mail of the mistake in
> >>>delivery. NOTE: Regardless of content, this e-mail shall not operate to
> >>>bind CSC to any order or other contract unless pursuant to explicit
> >>
> >>written
> >>
> >>>agreement or government initiative expressly permitting the use of
> >
> > e-mail
> >
> >>>for such purpose.
> >>
>
>>--------------------------------------------------------------------------
> >>--------------
> >>
> >>>
> >>>_______________________________________________
> >>>This mail is send to you from the opencms-dev mailing list
> >>>To change your list options, or to unsubscribe from the list, please
> >
> > visit
> >
> >>>http://mail.opencms.org/mailman/listinfo/opencms-dev
> >>>
> >>
> >>_______________________________________________
> >>This mail is send to you from the opencms-dev mailing list
> >>To change your list options, or to unsubscribe from the list, please
visit
> >>http://mail.opencms.org/mailman/listinfo/opencms-dev
> >>
> >>
> >
> >
> > _______________________________________________
> > This mail is send to you from the opencms-dev mailing list
> > To change your list options, or to unsubscribe from the list, please
visit
> > http://mail.opencms.org/mailman/listinfo/opencms-dev
>
>
> _______________________________________________
> This mail is send to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://mail.opencms.org/mailman/listinfo/opencms-dev
>
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PDFDocument.java
Type: application/octet-stream
Size: 1449 bytes
Desc: not available
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20031114/05578167/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: WordDocument.java
Type: application/octet-stream
Size: 1529 bytes
Desc: not available
URL: <https://webmail.opencms.org/pipermail/opencms-dev/attachments/20031114/05578167/attachment-0001.obj>


More information about the opencms-dev mailing list