[opencms-dev] Cleaning paste from word document - Re-configure jTidy

M Butcher mbutcher at grcomputing.net
Wed Nov 26 17:59:01 CET 2003


Olli Aro wrote:
> Matt,
> 
> Sorry to keep bugging you again with this. Have you
> got jTidy working properly at server side? I tried to
> set parameter setXhtml(true) in the class
> com.opencms.htmlconverter, but this did not make any
> difference - I still got html after save from the
> editor. Do you know, if com.opencms.htmlconverter is
> the right place to configure jTidy or is it
> overwritten somewhere else later?

To be completely honest, I never got it to do much more than the usual 
cleanup. I was trying to use it in the Lucene module to strip out tags, 
but ultimately I was unsuccessful.

I wonder if the XHTML setting in jTidy just validates the tag structures 
(e.g. changes <br> to <br/>), but ignores thing like XML declarations.

Also, I recall looking at the newer version of jTidy and thinking that 
it had some good features, but I never got around to downloading and 
testing with OpenCms. Right now, the sourceforge site is down, so I 
can't get to the site.

Matt

> 
>  
> 
>  --- M Butcher <mbutcher at grcomputing.net> wrote: >
> Olli,
> 
>>I wasn't that impressed with htmlArea's spell
>>checker -- it requires 
>>some perl CGIs on the server. However, I have been
>>happy with the 
>>quality of HTML that htmlArea puts out -- except
>>that it often uses 
>><br/><br/> instead of <p></p> on Mozilla (haven't
>>tested IE on that bug, 
>>specifically).
>>
>>The code was a little tought to navigate, but it
>>would be possible to 
>>make significant modification to the way it works.
>>
>>And, I think it should work fine with jTidy since
>>the latter runs on the 
>>server, not the client.
>>
>>Anyway... I hope you found it helpful.
>>
>>Matt
>>
>>
>>Olli Aro wrote:
>>
>>>Thanks Matt,
>>>
>>>I seems you are right. I found an online tool
>>
>>using htmltidy
>>
>>>(http://infohound.net/tidy/). It seems the Word
>>
>>2000 setting does remove
>>
>>>some of the MS Word formatting, but not all of it.
>>>
>>>HtmlArea looks very interesting and has other
>>
>>useful features as well such
>>
>>>as a spell check facility. However, I would still
>>
>>need jtidy working as
>>
>>>well - I would like to ensure that all WYSIWYG
>>
>>produced code is in xhtml and
>>
>>>know jtidy definetely would be able to do this.
>>>
>>>Regards,
>>>
>>>Olli
> 
> 
> 
> ________________________________________________________________________
> Want to chat instantly with your online friends?  Get the FREE Yahoo!
> Messenger http://mail.messenger.yahoo.co.uk
> _______________________________________________
> This mail is send to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://mail.opencms.org/mailman/listinfo/opencms-dev





More information about the opencms-dev mailing list