{"id":31002,"date":"2024-02-16T17:08:34","date_gmt":"2024-02-16T16:08:34","guid":{"rendered":"https:\/\/www.dbi-services.com\/blog\/?p=31002"},"modified":"2026-02-25T21:27:56","modified_gmt":"2026-02-25T20:27:56","slug":"alfresco-a-never-ending-transformation","status":"publish","type":"post","link":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/","title":{"rendered":"Alfresco &#8211; A never ending transformation"},"content":{"rendered":"\n<p>Beginning of the week, as I was working for our ServiceDesk (SLA support for our customers), I saw a few dozen mails generated by our monitoring over the weekend on a Production Alfresco 7.x Cluster doing the yo-yo in terms of RAM and Disk Space. Nothing was down, just some strange behavior where 20GB of free space would be gone and then re-appear after a few minutes and same thing for the RAM\/SWAP.<\/p>\n\n\n\n<p>The first thing I checked was the disk space mentioned on the alert. We received alerts from all members of the cluster one by one, almost in a perfect round-robin manner. On the second node, I saw the issue occurring in real-time, so I looked into what exactly was generating all the noise:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; highlight: [4,9,12,15,18]; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# date; df -h \/tmp\nMon Feb 12 07:27:41 UTC 2024\nFilesystem      Size  Used Avail Use% Mounted on\n\/dev\/sdb2        19G    7G   12G  35% \/tmp\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# date; df -h \/tmp\nMon Feb 12 07:28:20 UTC 2024\nFilesystem      Size  Used Avail Use% Mounted on\n\/dev\/sdb2        19G    9G    9G  49% \/tmp\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# du -sm \/tmp\/\n9427    \/tmp\/\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# du -sm \/tmp\/\n9484    \/tmp\/\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# du -sm \/tmp\/\n9541    \/tmp\/\nalfresco@alf-p2:~#\n<\/pre><\/div>\n\n\n<p>In less than a minute, around 2\/3Gb of temporary files were generated, which doesn&#8217;t seem very healthy:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; highlight: [15,16,24]; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# cd \/tmp\nalfresco@alf-p2:\/tmp#\nalfresco@alf-p2:\/tmp# ls -ltr\ntotal 480\n...\n-rw-r-----   1 alfresco  alfresco    115 Feb 11 21:26 scheduler.json\ndrwxr-x---   2 alfresco  alfresco   4096 Feb 12 07:28 Alfresco\/\ndrwxrwxrwt 117 root      root      12288 Feb 12 07:28 .\/\nalfresco@alf-p2:\/tmp#\nalfresco@alf-p2:\/tmp# cd Alfresco\/\nalfresco@alf-p2:\/tmp\/Alfresco# ls -l\ntotal 10553428\ndrwxr-x---   2 alfresco alfresco        4096 Feb 12 07:29 .\/\ndrwxrwxrwt 117 root     root           12288 Feb 12 07:29 ..\/\n-rw-r-----   1 alfresco alfresco     1897650 Feb 12 07:23 source_11877384286747332767_tmp.pdf\n-rw-r-----   1 alfresco alfresco 10804789248 Feb 12 07:29 target_18121744399232974935_tmp.txt\nalfresco@alf-p2:\/tmp\/Alfresco#\nalfresco@alf-p2:\/tmp\/Alfresco#\nalfresco@alf-p2:\/tmp\/Alfresco# ls -l\ntotal 10686460\ndrwxr-x---   2 alfresco alfresco        4096 Feb 12 07:29 .\/\ndrwxrwxrwt 117 root     root           12288 Feb 12 07:29 ..\/\n-rw-r-----   1 alfresco alfresco     1897650 Feb 12 07:23 source_11877384286747332767_tmp.pdf\n-rw-r-----   1 alfresco alfresco 10941014016 Feb 12 07:29 target_18121744399232974935_tmp.txt\nalfresco@alf-p2:\/tmp\/Alfresco#\n<\/pre><\/div>\n\n\n<p>At that point in time, it looked like Alfresco was doing something that was causing the issue for the Disk Space, at least. Here, we can see a PDF file that is a &#8220;source&#8221; and a TXT file that appears to be under generation, as a &#8220;target&#8221;. So of course, my first thought here is that this is probably the Alfresco Transformation Service that is causing this issue, trying to transform a PDF into TXT, most probably for indexing of the content of this file.<\/p>\n\n\n\n<p>While looking at the RAM\/SWAP usage on this server, it was also showing the same thing, with the Java process of the ATS using 100% CPU (fortunately, the host has multiple CPUs) and going overboard with its RAM, forcing the host to SWAP.<\/p>\n\n\n\n<p>Therefore, I looked at the ATS logs and saw 2 types of errors. First was a few IOException on PDFBox &#8220;<strong><em>Error: End-Of-File: expected line<\/em><\/strong>&#8221; but there wasn&#8217;t a lot of those\u2026 Then there was another error, much more present, that was the consequence of the FileSystem being full:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: java; highlight: [21,29,39]; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# cat $ATS_HOME\/logs\/transform-core-aio.log\n...\n2024-02-12 07:18:37.380 ERROR 23713 --- &#x5B;o-8090-exec-141] o.a.transformer.TransformController      : Error writing: Seite 1\n\norg.alfresco.transform.exceptions.TransformException: Error writing: Seite 1\n        at org.alfresco.transformer.executors.Transformer.transform(Transformer.java:83) ~&#x5B;alfresco-transformer-base-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.AIOController.transformImpl(AIOController.java:118) ~&#x5B;classes!\/:2.5.3]\n        at org.alfresco.transformer.AbstractTransformerController.transform(AbstractTransformerController.java:173) ~&#x5B;alfresco-transformer-base-2.5.3.jar!\/:2.5.3]\n        at jdk.internal.reflect.GeneratedMethodAccessor75.invoke(Unknown Source) ~&#x5B;na:na]\n        at java.base\/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~&#x5B;na:na]\n        at java.base\/java.lang.reflect.Method.invoke(Method.java:566) ~&#x5B;na:na]\n        ...\nCaused by: java.lang.IllegalStateException: Error writing: Seite 1\n        at org.alfresco.transformer.executors.Tika.transform(Tika.java:697) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.executors.Tika.transform(Tika.java:673) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.executors.Tika.transform(Tika.java:617) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.executors.TikaJavaExecutor.call(TikaJavaExecutor.java:141) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.executors.TikaJavaExecutor.transform(TikaJavaExecutor.java:131) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        at org.alfresco.transformer.executors.Transformer.transform(Transformer.java:70) ~&#x5B;alfresco-transformer-base-2.5.3.jar!\/:2.5.3]\n        ... 55 common frames omitted\nCaused by: org.xml.sax.SAXException: Error writing: Seite 1\n        at org.apache.tika.sax.ToTextContentHandler.characters(ToTextContentHandler.java:110) ~&#x5B;tika-core-1.26.jar!\/:1.26]\n        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) ~&#x5B;tika-core-1.26.jar!\/:1.26]\n        at org.apache.tika.sax.WriteOutContentHandler.characters(WriteOutContentHandler.java:136) ~&#x5B;tika-core-1.26.jar!\/:1.26]\n        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) ~&#x5B;tika-core-1.26.jar!\/:1.26]\n        ...\n        at org.alfresco.transformer.executors.Tika.transform(Tika.java:693) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n        ... 60 common frames omitted\n        Suppressed: java.io.IOException: No space left on device\n                at java.base\/java.io.FileOutputStream.writeBytes(Native Method) ~&#x5B;na:na]\n                at java.base\/java.io.FileOutputStream.write(FileOutputStream.java:354) ~&#x5B;na:na]\n                at java.base\/sun.nio.cs.StreamEncoder.writeBytes(StreamEncoder.java:233) ~&#x5B;na:na]\n                at java.base\/sun.nio.cs.StreamEncoder.implClose(StreamEncoder.java:337) ~&#x5B;na:na]\n                at java.base\/sun.nio.cs.StreamEncoder.close(StreamEncoder.java:161) ~&#x5B;na:na]\n                at java.base\/java.io.OutputStreamWriter.close(OutputStreamWriter.java:255) ~&#x5B;na:na]\n                at java.base\/java.io.BufferedWriter.close(BufferedWriter.java:269) ~&#x5B;na:na]\n                at org.alfresco.transformer.executors.Tika.transform(Tika.java:684) ~&#x5B;alfresco-transform-tika-2.5.3.jar!\/:2.5.3]\n                ... 60 common frames omitted\nCaused by: java.io.IOException: No space left on device\n        at java.base\/java.io.FileOutputStream.writeBytes(Native Method) ~&#x5B;na:na]\n        at java.base\/java.io.FileOutputStream.write(FileOutputStream.java:354) ~&#x5B;na:na]\n        at java.base\/sun.nio.cs.StreamEncoder.writeBytes(StreamEncoder.java:233) ~&#x5B;na:na]\n        at java.base\/sun.nio.cs.StreamEncoder.implWrite(StreamEncoder.java:303) ~&#x5B;na:na]\n...\nalfresco@alf-p2:~#\n<\/pre><\/div>\n\n\n<p>As you can see above, at 07:18, the FileSystem \/tmp was 100% full and when I checked 5 minutes later, at 07:23, a new transformation was already producing a 10Gb text file and still growing. So, it was clear that this happens repeatedly, most probably for the same document. According to the monitoring, the issue started just before the weekend. Looking at the first occurrences of the FileSystem full from the ATS logs gave the following:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# grep &#039;2024.*Error writing&#039; $ATS_HOME\/logs\/transform-core-aio.log\n2024-02-09 19:20:51.628 ERROR 23713 --- &#x5B;o-8090-exec-166] o.a.transformer.TransformController      : Error writing:\n2024-02-09 19:41:29.954 ERROR 23713 --- &#x5B;o-8090-exec-156] o.a.transformer.TransformController      : Error writing: Seite 1\n2024-02-09 20:02:11.764 ERROR 23713 --- &#x5B;o-8090-exec-160] o.a.transformer.TransformController      : Error writing: Seite 1\n2024-02-09 20:23:08.828 ERROR 23713 --- &#x5B;o-8090-exec-163] o.a.transformer.TransformController      : Error writing:\n2024-02-09 20:44:05.313 ERROR 23713 --- &#x5B;o-8090-exec-141] o.a.transformer.TransformController      : Error writing: Seite 1\n2024-02-09 21:04:52.642 ERROR 23713 --- &#x5B;o-8090-exec-162] o.a.transformer.TransformController      : Error writing: Seite 1\n...\n2024-02-12 07:18:37.380 ERROR 23713 --- &#x5B;o-8090-exec-152] o.a.transformer.TransformController      : Error writing: Seite 1\nalfresco@alf-p2:~#\n<\/pre><\/div>\n\n\n<p>With the above, it pretty much confirms that it&#8217;s the same document that is always failing, since it&#8217;s blocking on &#8220;<strong><em>Seite 1<\/em><\/strong>&#8220;, which means &#8220;<strong><em>Page 1<\/em><\/strong>&#8221; in English.<\/p>\n\n\n\n<p>To be able to find which document is causing the issue in Alfresco, there isn&#8217;t a lot of details available, since the ATS isn&#8217;t really giving you much about what it is doing. All I had was a <em>temporary name<\/em> (which obviously doesn&#8217;t trace back to anything in the Repository) and a <em>size<\/em>. Therefore, I checked for documents on the Alfresco Data (&#8220;<strong><em>alf_data<\/em><\/strong>&#8220;) with a size equal to the document &#8220;<strong><em>\/tmp\/Alfresco\/source_11877384286747332767_tmp.pdf<\/em><\/strong>&#8221; (i.e. 1897650 bytes), created in the last few days. I expected it to be created on the 9-Feb, a little before 19:20 and I indeed found one:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; highlight: [5,6]; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# find \/alf_data\/contentstore\/2024\/2\/ -type f -ls | grep 1897650\n 34508512  1856 -rw-r----- 1 alfresco alfresco 1897650 Feb 9 19:02 \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# md5sum \/tmp\/Alfresco\/source_11877384286747332767_tmp.pdf \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\n45ed40bd5f84b7c68e246885f2b6a55f  \/tmp\/Alfresco\/source_11877384286747332767_tmp.pdf\n45ed40bd5f84b7c68e246885f2b6a55f  \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\nalfresco@alf-p2:~#\nalfresco@alf-p2:~# diff \/tmp\/Alfresco\/source_11877384286747332767_tmp.pdf \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\nalfresco@alf-p2:~#\n<\/pre><\/div>\n\n\n<p>Therefore, this is the same content file. There is of course the possibility that a duplicate node was using the same content before February (as I searched only inside \/2024\/2, that means February), but since the issue appeared only over the weekend, it&#8217;s pretty safe to assume it&#8217;s this document\/node.<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; highlight: [2,6]; title: ; notranslate\" title=\"\">\nalfresco@alf-p2:~# stat \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\n  File: \/alf_data\/contentstore\/2024\/2\/9\/19\/02\/174f569e-93a3-4829-8ad5-bd3d6e78447b.bin\n  Size: 1897650         Blocks: 3712       IO Block: 262144 regular file\nDevice: 34h\/52d Inode: 34508512    Links: 1\nAccess: (0640\/-rw-r-----)  Uid: (  113\/alfresco)   Gid: (  116\/alfresco)\nAccess: 2024-02-09 19:02:12.153002964 +0000\nModify: 2024-02-09 19:02:12.157983495 +0000\nChange: 2024-02-09 19:02:12.157983635 +0000\n Birth: -\nalfresco@alf-p2:~#\n<\/pre><\/div>\n\n\n<p>From that point, I had the &#8220;<strong><em>content_url<\/em><\/strong>&#8221; of a Node. Therefore, I could have used the Database (see <a href=\"https:\/\/www.dbi-services.com\/blog\/alfresco-some-useful-database-queries\/\" target=\"_blank\" rel=\"noreferrer noopener\">useful database queries<\/a>) to find the NodeRef of this Alfresco Node but at this customer, I don&#8217;t have an easy access to the DB, so I went through Share instead.<\/p>\n\n\n\n<p>I know the node was created (or modified) at 19:02:12 (+\/- 1s) on the 9-Feb, and even if the content isn&#8217;t indexed, its metadata should still be available searchable. Therefore, I just performed a search on Alfresco Share, to find documents created (or modified) at that exact time, i.e. <strong><em>cm:created:&#8217;2024-02-09T19:02:12&#8242;<\/em><\/strong>.<\/p>\n\n\n\n<p>That gave me 4 results, out of which only 1 had a size around 2MB. To validate if this was indeed the document causing the issue, I simply used the <a href=\"https:\/\/github.com\/OrderOfTheBee\/ootbee-support-tools\" target=\"_blank\" rel=\"noreferrer noopener\">JavaScript Console<\/a> to dump this file and it gave me the exact same &#8220;<strong><em>content_url<\/em><\/strong>&#8220;. I could also validate on Share that this specific file wasn&#8217;t content-indexed yet (despite being in the repository for 2.5 days).<\/p>\n\n\n\n<p>As a temporary workaround, to stop the OS from going crazy, I set this document as metadata-indexed only (no content), using the &#8220;<em>Index Control<\/em>&#8221; aspect. If you don&#8217;t know how this works, it&#8217;s pretty simple for a node:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Click on &#8220;Manage Aspect&#8221;<\/li>\n\n\n\n<li>From the list of &#8220;Available to Add&#8221;, find &#8220;Index Control (cm:indexControl)&#8221;<\/li>\n\n\n\n<li>Click on &#8220;+&#8221; to add it to the list of &#8220;Currently Selected&#8221;<\/li>\n\n\n\n<li>Click on &#8220;Apply changes&#8221;<\/li>\n\n\n\n<li>Click on &#8220;Edit Properties&#8221;<\/li>\n\n\n\n<li>Uncheck the &#8220;Is Content Indexed&#8221; option<\/li>\n<\/ul>\n\n\n\n<p>After doing that, you should be able to see something like that on the node&#8217;s properties:<\/p>\n\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69eeca16d7815&quot;}\" data-wp-interactive=\"core\/image\" data-wp-key=\"69eeca16d7815\" class=\"wp-block-image size-full wp-lightbox-container\"><img loading=\"lazy\" decoding=\"async\" width=\"826\" height=\"188\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on--click=\"actions.showLightbox\" data-wp-on--load=\"callbacks.setButtonStyles\" data-wp-on-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png\" alt=\"Alfresco Index Control\" class=\"wp-image-31005\" srcset=\"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png 826w, https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl-300x68.png 300w, https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl-768x175.png 768w\" sizes=\"auto, (max-width: 826px) 100vw, 826px\" \/><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<p>In case a transformation for this document is already in progress, you will need to wait for the FileSystem to be full for the ATS (java) to remove its temporary file and realize that this document doesn&#8217;t need to be transformed anymore. You can probably also restart the process, if you prefer.<\/p>\n\n\n\n<p>That&#8217;s only a workaround of course, not a real solution. Therefore, even if I knew that the issue was most probably around &#8220;<strong><em>Seite 1<\/em><\/strong>&#8220;, I replicated the issue on TEST by uploading this same file into the TEST environment and then looked inside the TXT content, to validate that assumption:<\/p>\n\n\n<div class=\"wp-block-syntaxhighlighter-code \"><pre class=\"brush: bash; highlight: [9,12,17]; title: ; notranslate\" title=\"\">\nalfresco@alf-t1:\/tmp\/Alfresco# ls -l\ntotal 23960\ndrwxr-x---  2 alfresco alfresco      4096 Feb 12 09:10 .\/\ndrwxrwxrwt 25 root     root         36864 Feb 12 09:10 ..\/\n-rw-r-----  1 alfresco alfresco   1897650 Feb 12 09:10 source_2995534351432950419_tmp.pdf\n-rw-r-----  1 alfresco alfresco  22593536 Feb 12 09:10 target_7429882841367188802_tmp.txt\nalfresco@alf-t1:\/tmp\/Alfresco#\nalfresco@alf-t1:\/tmp\/Alfresco# wc -l target_7429882841367188802_tmp.txt\n2509490 target_7429882841367188802_tmp.txt\nalfresco@alf-t1:\/tmp\/Alfresco#\nalfresco@alf-t1:\/tmp\/Alfresco# grep -v &quot;^&#x5B;&#x5B;:space:]]*Seite 1$&quot; target_7429882841367188802_tmp.txt | wc -l\n1913\nalfresco@alf-t1:\/tmp\/Alfresco#\nalfresco@alf-t1:\/tmp\/Alfresco# sleep 30\nalfresco@alf-t1:\/tmp\/Alfresco#\nalfresco@alf-t1:\/tmp\/Alfresco# wc -l target_7429882841367188802_tmp.txt\n83418233 target_7429882841367188802_tmp.txt\nalfresco@alf-t1:\/tmp\/Alfresco#\nalfresco@alf-t1:\/tmp\/Alfresco# grep -v &quot;^&#x5B;&#x5B;:space:]]*Seite 1$&quot; target_7429882841367188802_tmp.txt | wc -l\n1913\nalfresco@alf-t1:\/tmp\/Alfresco#\n<\/pre><\/div>\n\n\n<p>As shown above, there are 1913 lines of some texts and then the rest of the millions of lines are exactly &#8220;<strong><em>    Seite 1<\/em><\/strong>&#8220;. This text is actually coming from the page 34 of the PDF (it&#8217;s a merge of multiple PDFs it seems). By removing the page 34 from the document, it can be indexed properly. In the end, the &#8220;quick&#8221; solution for this customer is to fix the PDF (e.g. transform the page 34 into an image, then back into a PDF and OCRize it so it is indexed and searchable).<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Beginning of the week, as I was working for our ServiceDesk (SLA support for our customers), I saw a few dozen mails generated by our monitoring over the weekend on a Production Alfresco 7.x Cluster doing the yo-yo in terms of RAM and Disk Space. Nothing was down, just some strange behavior where 20GB of [&hellip;]<\/p>\n","protected":false},"author":20,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[525],"tags":[3261,3169,3259,2766,3260],"type_dbi":[3262,3266,3265,3263,3264],"class_list":["post-31002","post","type-post","status-publish","format-standard","hentry","category-enterprise-content-management","tag-aio","tag-alfresco","tag-ats","tag-indexing","tag-transform","type-aio","type-alfresco","type-ats","type-indexing","type-transform"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.2 (Yoast SEO v27.4) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Alfresco - A never ending transformation - dbi Blog<\/title>\n<meta name=\"description\" content=\"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Alfresco - A never ending transformation\" \/>\n<meta property=\"og:description\" content=\"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/\" \/>\n<meta property=\"og:site_name\" content=\"dbi Blog\" \/>\n<meta property=\"article:published_time\" content=\"2024-02-16T16:08:34+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-25T20:27:56+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png\" \/>\n\t<meta property=\"og:image:width\" content=\"826\" \/>\n\t<meta property=\"og:image:height\" content=\"188\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Morgan Patou\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@MorganPatou\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Morgan Patou\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/\"},\"author\":{\"name\":\"Morgan Patou\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/#\\\/schema\\\/person\\\/c4d05b25843a9bc2ab20415dae6bd2d8\"},\"headline\":\"Alfresco &#8211; A never ending transformation\",\"datePublished\":\"2024-02-16T16:08:34+00:00\",\"dateModified\":\"2026-02-25T20:27:56+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/\"},\"wordCount\":1043,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/indexControl.png\",\"keywords\":[\"AIO\",\"Alfresco\",\"ATS\",\"indexing\",\"Transform\"],\"articleSection\":[\"Enterprise content management\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/\",\"url\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/\",\"name\":\"Alfresco - A never ending transformation - dbi Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/indexControl.png\",\"datePublished\":\"2024-02-16T16:08:34+00:00\",\"dateModified\":\"2026-02-25T20:27:56+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/#\\\/schema\\\/person\\\/c4d05b25843a9bc2ab20415dae6bd2d8\"},\"description\":\"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/indexControl.png\",\"contentUrl\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/indexControl.png\",\"width\":826,\"height\":188},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/alfresco-a-never-ending-transformation\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Alfresco &#8211; A never ending transformation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/\",\"name\":\"dbi Blog\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/#\\\/schema\\\/person\\\/c4d05b25843a9bc2ab20415dae6bd2d8\",\"name\":\"Morgan Patou\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g\",\"caption\":\"Morgan Patou\"},\"description\":\"Morgan Patou has over 12 years of experience in Enterprise Content Management (ECM) systems, with a strong focus in recent years on platforms such as Alfresco, Documentum, and M-Files. He specializes in the architecture, setup, customization, and maintenance of ECM infrastructures in complex &amp; critical environments. Morgan is well-versed in both engineering and operations aspects, including high availability design, system integration, and lifecycle management. He also has a solid foundation in open-source and proprietary technologies - ranging from Apache, OpenLDAP or Kerberos to enterprise-grade systems like WebLogic. Morgan Patou holds an Engineering Degree in Computer Science from ENSISA (\u00c9cole Nationale Sup\u00e9rieure d'Ing\u00e9nieurs Sud Alsace) in Mulhouse, France. He is Alfresco Content Services Certified Administrator (ACSCA), Alfresco Content Services Certified Engineer (ACSCE) as well as OpenText Documentum Certified Administrator. His industry experience spans the Public Sector, IT Services, Financial Services\\\/Banking, and the Pharmaceutical industry.\",\"sameAs\":[\"https:\\\/\\\/blog.dbi-services.com\\\/author\\\/morgan-patou\\\/\",\"https:\\\/\\\/x.com\\\/MorganPatou\"],\"url\":\"https:\\\/\\\/www.dbi-services.com\\\/blog\\\/author\\\/morgan-patou\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Alfresco - A never ending transformation - dbi Blog","description":"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/","og_locale":"en_US","og_type":"article","og_title":"Alfresco - A never ending transformation","og_description":"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?","og_url":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/","og_site_name":"dbi Blog","article_published_time":"2024-02-16T16:08:34+00:00","article_modified_time":"2026-02-25T20:27:56+00:00","og_image":[{"width":826,"height":188,"url":"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png","type":"image\/png"}],"author":"Morgan Patou","twitter_card":"summary_large_image","twitter_creator":"@MorganPatou","twitter_misc":{"Written by":"Morgan Patou","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#article","isPartOf":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/"},"author":{"name":"Morgan Patou","@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/c4d05b25843a9bc2ab20415dae6bd2d8"},"headline":"Alfresco &#8211; A never ending transformation","datePublished":"2024-02-16T16:08:34+00:00","dateModified":"2026-02-25T20:27:56+00:00","mainEntityOfPage":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/"},"wordCount":1043,"commentCount":0,"image":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png","keywords":["AIO","Alfresco","ATS","indexing","Transform"],"articleSection":["Enterprise content management"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/","url":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/","name":"Alfresco - A never ending transformation - dbi Blog","isPartOf":{"@id":"https:\/\/www.dbi-services.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#primaryimage"},"image":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png","datePublished":"2024-02-16T16:08:34+00:00","dateModified":"2026-02-25T20:27:56+00:00","author":{"@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/c4d05b25843a9bc2ab20415dae6bd2d8"},"description":"An infinite transformation from the Alfresco Transform Service for text indexing is using all disk space and RAM?","breadcrumb":{"@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#primaryimage","url":"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png","contentUrl":"https:\/\/www.dbi-services.com\/blog\/wp-content\/uploads\/sites\/2\/2024\/02\/indexControl.png","width":826,"height":188},{"@type":"BreadcrumbList","@id":"https:\/\/www.dbi-services.com\/blog\/alfresco-a-never-ending-transformation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/www.dbi-services.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Alfresco &#8211; A never ending transformation"}]},{"@type":"WebSite","@id":"https:\/\/www.dbi-services.com\/blog\/#website","url":"https:\/\/www.dbi-services.com\/blog\/","name":"dbi Blog","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.dbi-services.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/c4d05b25843a9bc2ab20415dae6bd2d8","name":"Morgan Patou","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5d7f5bec8b597db68a09107a6f5309e3870d6296ef94fb10ead4b09454ca67e5?s=96&d=mm&r=g","caption":"Morgan Patou"},"description":"Morgan Patou has over 12 years of experience in Enterprise Content Management (ECM) systems, with a strong focus in recent years on platforms such as Alfresco, Documentum, and M-Files. He specializes in the architecture, setup, customization, and maintenance of ECM infrastructures in complex &amp; critical environments. Morgan is well-versed in both engineering and operations aspects, including high availability design, system integration, and lifecycle management. He also has a solid foundation in open-source and proprietary technologies - ranging from Apache, OpenLDAP or Kerberos to enterprise-grade systems like WebLogic. Morgan Patou holds an Engineering Degree in Computer Science from ENSISA (\u00c9cole Nationale Sup\u00e9rieure d'Ing\u00e9nieurs Sud Alsace) in Mulhouse, France. He is Alfresco Content Services Certified Administrator (ACSCA), Alfresco Content Services Certified Engineer (ACSCE) as well as OpenText Documentum Certified Administrator. His industry experience spans the Public Sector, IT Services, Financial Services\/Banking, and the Pharmaceutical industry.","sameAs":["https:\/\/blog.dbi-services.com\/author\/morgan-patou\/","https:\/\/x.com\/MorganPatou"],"url":"https:\/\/www.dbi-services.com\/blog\/author\/morgan-patou\/"}]}},"_links":{"self":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts\/31002","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/users\/20"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/comments?post=31002"}],"version-history":[{"count":6,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts\/31002\/revisions"}],"predecessor-version":[{"id":43180,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts\/31002\/revisions\/43180"}],"wp:attachment":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/media?parent=31002"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/categories?post=31002"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/tags?post=31002"},{"taxonomy":"type","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/type_dbi?post=31002"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}