<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
Hi,<br>
<br>
this function is used while indexing MSPowerPoint file for fulltext
search.<br>
<br>
MsPowerPoint files content extractor for fulltext is rather buggy in
current <br>
OpenCms releases, beside of this OutOfMemmory problem there is national
<br>
charset problem too (some national characters are badly interpreted).<br>
<br>
My first quick patch to this bug was to check "size" value and then
skip <br>
next few rows of code if value is too big.<br>
<br>
I am trying some another implementation of MsPowerPoint extractor now<br>
based on new version of POI library, but only for OpenCms 6.2.x now. <br>
I plan to send this patched implementation to Alkacon to add it to <br>
OpenCms 7 release in near future.<br>
<br>
Quick sollution for you is to reconfigure your Fulltext search <br>
Index sources not to add MS PowerPoint files to fulltext search indexes.<br>
But these files then can't be found by fulltext search.<br>
<br>
Regards<br>
<br>
Vlastik<br>
<br>
Le Bach napsal(a):
<blockquote
cite="mid:3d8ca5df0805070219y4ed5fa70ja70a8999c0bd14f5@mail.gmail.com"
type="cite">
<div>Dear all,<br>
<br>
I am new to the OpenCMS 7, I got problem with execution of this
function (automatically called when publishing):<br>
<br>
<font face="courier new">org.opencms.search.extractors.CmsExtractorMsPowerPoint.processPOIFSReaderEvent(POIFSReaderEvent
event) {<br>
...<br>
int size = (int)LittleEndian.getUInt(buffer, i + 4) + 3;<br>
...<br>
byte[] buf = new byte[size]; <font color="#ff0000">//</font><font
color="#ff0000">produce error out of heap space here, although JVM
heap size is set to 1024MB (-Xmx 1024M)</font><br>
...</font></div>
<div><font face="courier new">}</font> <br>
<br>
The variable <font color="#ff0000">size</font> above, with some
powerpoint files, sometimes get the very big number (>=1 million),
then get OutOfMemoryError.<br>
<br>
I strongly want to know the purpose of this function, and can I change
code script to avoid OutOfMemoryError without spoiling it ?<br>
<br>
I'm looking for your idea.<br>
Any help would be appreciated.<br>
<br>
</div>
<div>-- </div>
<div>Bach Le </div>
<pre wrap="">
<hr size="4" width="90%">
_______________________________________________
This mail is sent to you from the opencms-dev mailing list
To change your list options, or to unsubscribe from the list, please visit
<a class="moz-txt-link-freetext" href="http://lists.opencms.org/mailman/listinfo/opencms-dev">http://lists.opencms.org/mailman/listinfo/opencms-dev</a></pre>
</blockquote>
<br>
<pre class="moz-signature" cols="72">--
Ing. Vlastimil Elias Qbizm technologies, a.s.
vedouci analytik/teamleader ... the art of software.
____________________________________________________________________
<a class="moz-txt-link-abbreviated" href="http://www.qbizm-technologies.cz">www.qbizm-technologies.cz</a> <a class="moz-txt-link-abbreviated" href="http://www.qbizm.cz">www.qbizm.cz</a> <a class="moz-txt-link-abbreviated" href="http://www.qbizm-services.cz">www.qbizm-services.cz</a></pre>
</body>
</html>