[opencms-dev] Problems with indexing HTML files

Tran Ngoc Huy HuyTran at c-mg.net
Tue Dec 25 09:56:56 CET 2007


It is Ok now when I add org.opencms.search.documents.CmsDocumentXmlPage 
to document type list.

Thank you and sorry if this disturbed you :).

Tran Ngoc Huy wrote:
> Hi opencms-dev,
> I am having a problem with indexing using OpenCms 6.2.3. I modify 3 
> files in the workplace, 1 is an HTML page and other 2 is PDF files. When 
> I publish these files using OpenCMS API, the indexing seems working fine 
> with 2 PDFs but not with HTML file with ... skipped and as the results 
> the new text in the HTML can not be searched. I do the same with these 
> files in the workplace and the results are the same with HTML skipped. I 
> list the report console out bellow. Can you tell me what is wrong? Do I 
> miss any thing in the Document type in the indexing configurations?
>
> Thank you very much!
>
> 15:16:56,989 INFO  [STDOUT] ------ Publishing resources ...
> 15:16:57,029 INFO  [STDOUT] ------ Publishing files ...
> 15:16:57,030 INFO  [STDOUT] ( 1 / 3 )
> 15:16:57,030 INFO  [STDOUT] Publishing file
> 15:16:57,031 INFO  [STDOUT]  /sites/default/publishing/index.html
> 15:16:57,031 INFO  [STDOUT] ...
> 15:16:57,443 INFO  [STDOUT] o.k.
> 15:16:57,446 INFO  [STDOUT] ( 2 / 3 )
> 15:16:57,446 INFO  [STDOUT] Publishing file
> 15:16:57,446 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
> 15:16:57,446 INFO  [STDOUT] ...
> 15:16:58,641 INFO  [STDOUT] o.k.
> 15:16:58,644 INFO  [STDOUT] ( 3 / 3 )
> 15:16:58,645 INFO  [STDOUT] Publishing file
> 15:16:58,645 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
> 15:16:58,645 INFO  [STDOUT] ...
> 15:16:59,516 INFO  [STDOUT] o.k.
> 15:16:59,518 INFO  [STDOUT] ------ ... finished publishing files
> 15:16:59,518 INFO  [STDOUT] Statistics: published files: 3, published 
> folders: 0, deleted folders: 0
> , duration: 00:00:02
> 15:16:59,633 INFO  [STDOUT] ------ Updating search index "Online project 
> (VFS)"
> 15:16:59,727 INFO  [STDOUT] ( 1 )
> 15:16:59,727 INFO  [STDOUT] Indexing file
> 15:16:59,727 INFO  [STDOUT]  /sites/default/publishing/index.html
> 15:16:59,727 INFO  [STDOUT] ...
> 15:16:59,728 INFO  [STDOUT] skipped
> 15:16:59,729 INFO  [STDOUT] ( 2 )
> 15:16:59,730 INFO  [STDOUT] Indexing file
> 15:16:59,730 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
> 15:16:59,730 INFO  [STDOUT] ...
> 15:17:01,245 INFO  [STDOUT] o.k.
> 15:17:01,261 INFO  [STDOUT] ( 3 )
> 15:17:01,261 INFO  [STDOUT] Indexing file
> 15:17:01,261 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
> 15:17:01,261 INFO  [STDOUT] ...
> 15:17:01,915 INFO  [STDOUT] o.k.
> 15:17:02,228 INFO  [STDOUT] ------ ... finished updating search index 
> "Online project (VFS)"
> 15:17:02,234 INFO  [STDOUT] ------ Updating search index "Offline 
> project (VFS)"
> 15:17:02,256 INFO  [STDOUT] ( 1 )
> 15:17:02,256 INFO  [STDOUT] Indexing file
> 15:17:02,256 INFO  [STDOUT]  /sites/default/publishing/index.html
> 15:17:02,256 INFO  [STDOUT] ...
> 15:17:02,256 INFO  [STDOUT] skipped
> 15:17:02,258 INFO  [STDOUT] ( 2 )
> 15:17:02,259 INFO  [STDOUT] Indexing file
> 15:17:02,259 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
> 15:17:02,259 INFO  [STDOUT] ...
> 15:17:02,276 INFO  [STDOUT] o.k.
> 15:17:02,278 INFO  [STDOUT] ( 3 )
> 15:17:02,278 INFO  [STDOUT] Indexing file
> 15:17:02,278 INFO  [STDOUT]  
> /system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
> 15:17:02,278 INFO  [STDOUT] ...
> 15:17:02,311 INFO  [STDOUT] o.k.
> 15:17:02,422 INFO  [STDOUT] ------ ... finished updating search index 
> "Offline project (VFS)"
> 15:17:02,436 INFO  [STDOUT] ------ ... the resources have been published
>
>   

-- 
Huy Tran

Senior Software Engineer

Claybourne McGregor Consulting Ltd
email:   HuyTran at c-mg.biz
web:     www.c-mg.biz
phone:   +84 4 9446921
mobile:  +84 904 338557

Add UK: Claybourne McGregor Consulting Ltd (CMG, LTD), High Trees, Hillfield Road, Hemel Hempstead, Hertfordshire HP2 4AY, UK  
Add VN: Claybourne McGregor Ltd (CMG CO., LTD), 14-16 Ham Long, Hoan Kiem district, Hanoi, Vietnam 




More information about the opencms-dev mailing list