[opencms-dev] Problems with indexing HTML files

Tran Ngoc Huy HuyTran at c-mg.net
Tue Dec 25 09:44:35 CET 2007


Hi opencms-dev,
I am having a problem with indexing using OpenCms 6.2.3. I modify 3 
files in the workplace, 1 is an HTML page and other 2 is PDF files. When 
I publish these files using OpenCMS API, the indexing seems working fine 
with 2 PDFs but not with HTML file with ... skipped and as the results 
the new text in the HTML can not be searched. I do the same with these 
files in the workplace and the results are the same with HTML skipped. I 
list the report console out bellow. Can you tell me what is wrong? Do I 
miss any thing in the Document type in the indexing configurations?

Thank you very much!

15:16:56,989 INFO  [STDOUT] ------ Publishing resources ...
15:16:57,029 INFO  [STDOUT] ------ Publishing files ...
15:16:57,030 INFO  [STDOUT] ( 1 / 3 )
15:16:57,030 INFO  [STDOUT] Publishing file
15:16:57,031 INFO  [STDOUT]  /sites/default/publishing/index.html
15:16:57,031 INFO  [STDOUT] ...
15:16:57,443 INFO  [STDOUT] o.k.
15:16:57,446 INFO  [STDOUT] ( 2 / 3 )
15:16:57,446 INFO  [STDOUT] Publishing file
15:16:57,446 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
15:16:57,446 INFO  [STDOUT] ...
15:16:58,641 INFO  [STDOUT] o.k.
15:16:58,644 INFO  [STDOUT] ( 3 / 3 )
15:16:58,645 INFO  [STDOUT] Publishing file
15:16:58,645 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
15:16:58,645 INFO  [STDOUT] ...
15:16:59,516 INFO  [STDOUT] o.k.
15:16:59,518 INFO  [STDOUT] ------ ... finished publishing files
15:16:59,518 INFO  [STDOUT] Statistics: published files: 3, published 
folders: 0, deleted folders: 0
, duration: 00:00:02
15:16:59,633 INFO  [STDOUT] ------ Updating search index "Online project 
(VFS)"
15:16:59,727 INFO  [STDOUT] ( 1 )
15:16:59,727 INFO  [STDOUT] Indexing file
15:16:59,727 INFO  [STDOUT]  /sites/default/publishing/index.html
15:16:59,727 INFO  [STDOUT] ...
15:16:59,728 INFO  [STDOUT] skipped
15:16:59,729 INFO  [STDOUT] ( 2 )
15:16:59,730 INFO  [STDOUT] Indexing file
15:16:59,730 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
15:16:59,730 INFO  [STDOUT] ...
15:17:01,245 INFO  [STDOUT] o.k.
15:17:01,261 INFO  [STDOUT] ( 3 )
15:17:01,261 INFO  [STDOUT] Indexing file
15:17:01,261 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
15:17:01,261 INFO  [STDOUT] ...
15:17:01,915 INFO  [STDOUT] o.k.
15:17:02,228 INFO  [STDOUT] ------ ... finished updating search index 
"Online project (VFS)"
15:17:02,234 INFO  [STDOUT] ------ Updating search index "Offline 
project (VFS)"
15:17:02,256 INFO  [STDOUT] ( 1 )
15:17:02,256 INFO  [STDOUT] Indexing file
15:17:02,256 INFO  [STDOUT]  /sites/default/publishing/index.html
15:17:02,256 INFO  [STDOUT] ...
15:17:02,256 INFO  [STDOUT] skipped
15:17:02,258 INFO  [STDOUT] ( 2 )
15:17:02,259 INFO  [STDOUT] Indexing file
15:17:02,259 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Autumn_2006.pdf
15:17:02,259 INFO  [STDOUT] ...
15:17:02,276 INFO  [STDOUT] o.k.
15:17:02,278 INFO  [STDOUT] ( 3 )
15:17:02,278 INFO  [STDOUT] Indexing file
15:17:02,278 INFO  [STDOUT]  
/system/galleries/download/Pensioner_newsletter/Summer_2006.pdf
15:17:02,278 INFO  [STDOUT] ...
15:17:02,311 INFO  [STDOUT] o.k.
15:17:02,422 INFO  [STDOUT] ------ ... finished updating search index 
"Offline project (VFS)"
15:17:02,436 INFO  [STDOUT] ------ ... the resources have been published

-- 
Huy Tran

Senior Software Engineer

Claybourne McGregor Consulting Ltd
email:   HuyTran at c-mg.biz
web:     www.c-mg.biz
phone:   +84 4 9446921
mobile:  +84 904 338557

Add UK: Claybourne McGregor Consulting Ltd (CMG, LTD), High Trees, Hillfield Road, Hemel Hempstead, Hertfordshire HP2 4AY, UK  
Add VN: Claybourne McGregor Ltd (CMG CO., LTD), 14-16 Ham Long, Hoan Kiem district, Hanoi, Vietnam 




More information about the opencms-dev mailing list