[opencms-dev] Cleaning paste from word document - Re-configure jTidy

M Butcher mbutcher at grcomputing.net
Wed Nov 26 18:04:01 CET 2003


I take that back... it wasn't jTidy that had a new release. Looks like 
their last release was quite a while ago. Oops.

Matt

M Butcher wrote:
> Olli Aro wrote:
> 
>> Matt,
>>
>> Sorry to keep bugging you again with this. Have you
>> got jTidy working properly at server side? I tried to
>> set parameter setXhtml(true) in the class
>> com.opencms.htmlconverter, but this did not make any
>> difference - I still got html after save from the
>> editor. Do you know, if com.opencms.htmlconverter is
>> the right place to configure jTidy or is it
>> overwritten somewhere else later?
> 
> 
> To be completely honest, I never got it to do much more than the usual 
> cleanup. I was trying to use it in the Lucene module to strip out tags, 
> but ultimately I was unsuccessful.
> 
> I wonder if the XHTML setting in jTidy just validates the tag structures 
> (e.g. changes <br> to <br/>), but ignores thing like XML declarations.
> 
> Also, I recall looking at the newer version of jTidy and thinking that 
> it had some good features, but I never got around to downloading and 
> testing with OpenCms. Right now, the sourceforge site is down, so I 
> can't get to the site.
> 
> Matt
> 
>>
>>  
>>
>>  --- M Butcher <mbutcher at grcomputing.net> wrote: >
>> Olli,
>>
>>> I wasn't that impressed with htmlArea's spell
>>> checker -- it requires some perl CGIs on the server. However, I have 
>>> been
>>> happy with the quality of HTML that htmlArea puts out -- except
>>> that it often uses <br/><br/> instead of <p></p> on Mozilla (haven't
>>> tested IE on that bug, specifically).
>>>
>>> The code was a little tought to navigate, but it
>>> would be possible to make significant modification to the way it works.
>>>
>>> And, I think it should work fine with jTidy since
>>> the latter runs on the server, not the client.
>>>
>>> Anyway... I hope you found it helpful.
>>>
>>> Matt
>>>
>>>
>>> Olli Aro wrote:
>>>
>>>> Thanks Matt,
>>>>
>>>> I seems you are right. I found an online tool
>>>
>>>
>>> using htmltidy
>>>
>>>> (http://infohound.net/tidy/). It seems the Word
>>>
>>>
>>> 2000 setting does remove
>>>
>>>> some of the MS Word formatting, but not all of it.
>>>>
>>>> HtmlArea looks very interesting and has other
>>>
>>>
>>> useful features as well such
>>>
>>>> as a spell check facility. However, I would still
>>>
>>>
>>> need jtidy working as
>>>
>>>> well - I would like to ensure that all WYSIWYG
>>>
>>>
>>> produced code is in xhtml and
>>>
>>>> know jtidy definetely would be able to do this.
>>>>
>>>> Regards,
>>>>
>>>> Olli
>>
>>
>>
>>
>> ________________________________________________________________________
>> Want to chat instantly with your online friends?  Get the FREE Yahoo!
>> Messenger http://mail.messenger.yahoo.co.uk
>> _______________________________________________
>> This mail is send to you from the opencms-dev mailing list
>> To change your list options, or to unsubscribe from the list, please 
>> visit
>> http://mail.opencms.org/mailman/listinfo/opencms-dev
> 
> 
> 
> _______________________________________________
> This mail is send to you from the opencms-dev mailing list
> To change your list options, or to unsubscribe from the list, please visit
> http://mail.opencms.org/mailman/listinfo/opencms-dev





More information about the opencms-dev mailing list