- 04 Jun, 2021 1 commit
-
-
Jérome Perrin authored
With tesseract v4.0.0-beta.3 we often observe crashes with: ``` contains_unichar_id(unichar_id):Error:Assert failed:in file ../../src/ccutil/unicharset.h, line 511 ``` This seems to have been fixed by https://github.com/tesseract-ocr/tesseract/pull/1954 Still, even after updating to 4.1.1, text recognition from PDF in ERP5 is too expensive. We also update Ghostscript to 9.54.0, because this version has built-in OCR, which does not need to convert the PDF to PNG then TIFF as we currently do in ERP5. See merge request nexedi/slapos!985
-
- 03 Jun, 2021 2 commits
-
-
Thomas Gambier authored
-
Jérome Perrin authored
This tessdata path will be included in cpp code by pre-processor macros https://github.com/ArtifexSoftware/ghostpdl/blob/gs9.54.0/base/tessocr.cpp#L188-L193 Since // is the marker for a comment in cpp and as documented in https://gcc.gnu.org/onlinedocs/cpp/Stringizing.html "Comments are replaced by whitespace long before stringizing happens, so they never appear in stringized text", the STRINGIFY/STRINGIFY2 approach of including a path does not work when the path contain // , because anything after // is considered a comment and is not included, causing errors like this when using ghostscript with OCR in webrunner: $ strace -e open -o open.strace /srv/slapgrid/slappart42/srv/runner/shared/ghostscript/4387fe7a8d2034ac5691d43b58134248/bin/gs -sDEVICE=ocr GPL Ghostscript 9.54.0 (2021-03-30) Copyright (C) 2021 Artifex Software, Inc. All rights reserved. This software is supplied under the GNU AGPLv3 and comes with NO WARRANTY: see the file COPYING for details. Error opening data file ./eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages! **** Unable to open the initial device, quitting. $ grep eng open.strace open("./eng.traineddata", O_RDONLY) = -1 ENOENT (No such file or directory) open("/srv/slapgrid/slappart42/srv/eng.traineddata", O_RDONLY) = -1 ENOENT (No such file or directory) open("eng.traineddata", O_RDONLY) = -1 ENOENT (No such file or directory) eng.traineddata is looked up in /srv/slapgrid/slappart42/srv/ because ghostscript was configured with: --with-tessdata=/srv/slapgrid/slappart42/srv//runner//shared/ghostscript/4387fe7a8d2034ac5691d43b58134248/share/tessdata/ and everything after // was stripped. This was reported upstream as https://bugs.ghostscript.com/show_bug.cgi?id=703905 More about the case of // in slaprunner paths was on commit eb544196 (slparunner: document the reasons why we keep srv//slaprunner, 2019-10-10)
-
- 02 Jun, 2021 4 commits
-
-
Xavier Thompson authored
Before this commit, instance parameters that have a default null value in JSON format were rendered as 'None' string in instance-theia.cfg. In particular, Theia frontends were always requested with unsolvable SLA parameters that prevented a frontend from ever being allocated: ``` sla-instance_guid = None ```
-
Łukasz Nowak authored
proxy.config.snapshot_dir is unavailable since TrafficServer 8.
-
Łukasz Nowak authored
-
Łukasz Nowak authored
-
- 31 May, 2021 7 commits
-
-
Kirill Smelkov authored
See 9c5d25cb; sorry for missing to update the duplicate in that patch.
-
Kirill Smelkov authored
In commit 48b24182 I passed 'wcfs_enable' to instance-zope.cfg as string instead of an object. As the result, even if it was wcfs_enable=False, in instance-zope.cfg in ---- 8< ---- https://lab.nexedi.com/nexedi/slapos/blob/8d3af101/stack/erp5/instance-zope.cfg.in#L68-72) {% if slapparameter_dict['wcfs_enable'] %} WENDELIN_CORE_VIRTMEM=r:wcfs+w:uvmm {% else %} WENDELIN_CORE_VIRTMEM=rw:uvmm {% endif %} it was always taking the first branch, becuse slapparameter_dict['wcfs_enable'] was giving 'False' - a string - whose boolean value is True In [1]: bool('False') Out[1]: True -> Fix it by passing slapparameter_dict['wcfs_enable'], similarly to most other keys, as object.
-
Kirill Smelkov authored
Due to the typo in 6572375c the revision was not unset and so it was wendelin.core 1 that was still being built and installed.
-
Łukasz Nowak authored
-
Łukasz Nowak authored
All data related to the instance shall be in the instance itself, and as temporary files are crucial for caucase, force the macro user to set the location.
-
Jérome Perrin authored
This is required for example to preview images (ie. open a .png file from the editor). Without the default value of {{uuid}}.mini-browser.{{hostname}} is used but we don't have anything in place to resolve such domain names.
-
Xavier Thompson authored
See merge request nexedi/slapos!990
-
- 28 May, 2021 2 commits
-
-
Kirill Smelkov authored
Going Go1.15.11 -> Go1.15.12 brings in fixes to compiler, runtime and stdlib including security fix to net/http package: https://golang.org/doc/devel/release.html#go1.15.minor Tested on helloworld SR (by adjusting it to use go1.15 instead of go1.16).
-
Kirill Smelkov authored
Going Go1.16.3 -> Go1.16.4 brings in fixes to compiler, runtime and stdlib including security fix to net/http package: https://golang.org/doc/devel/release.html#go1.16.minor Tested on helloworld SR.
-
- 27 May, 2021 6 commits
-
-
Rafael Monnerat authored
-
Xavier Thompson authored
Before this commit, the Go workspace was always activated before launching the Theia shell and starting the embedded supervisord. This in turn added the gcc installed by Theia to the PATH, meaning that nested instances would be compiled with Theia's gcc instead of the system gcc, making the compilation outcome dependant on Theia. With this commit, the gcc resolved by PATH is now the system one.
-
Xavier Thompson authored
See merge request nexedi/slapos!987
-
Xavier Thompson authored
This option takes a string representing a valid JSON object and forwards the corresponding python dict to the embedded instance.
-
Xavier Thompson authored
-
Xavier Thompson authored
-
- 26 May, 2021 4 commits
-
-
Jérome Perrin authored
This will be used to do OCR from PDF documents
-
Jérome Perrin authored
This version includes an OCR engine based on tesseract.
-
Jérome Perrin authored
Also enable shared parts and provide new languages: Simplied Chinese, Japanese and French
-
Jérome Perrin authored
-
- 25 May, 2021 2 commits
-
-
Xavier Thompson authored
-
Łukasz Nowak authored
Features: * random response is possible of big sizes * timeout can be randomized
-
- 24 May, 2021 1 commit
-
-
Xavier Thompson authored
See merge request nexedi/slapos!979
-
- 21 May, 2021 3 commits
-
-
Julien Muchembled authored
-
Thomas Gambier authored
This fixes 157dd7ab.
-
Jérome Perrin authored
-
- 20 May, 2021 2 commits
-
-
Thomas Gambier authored
-
Łukasz Nowak authored
It workarounds stack/resilient/pbsready.cfg.in directory:backup string concatenation: backup = $${basedirectory:backup}/$${slap-parameter:namebase}
-
- 19 May, 2021 3 commits
-
-
Xavier Thompson authored
-
Xavier Thompson authored
-
Xavier Thompson authored
-
- 17 May, 2021 1 commit
-
-
Xavier Thompson authored
See merge request !981
-
- 16 May, 2021 1 commit
-
-
Julien Muchembled authored
This fixes: Installing jupyter. While: Installing jupyter. Base installation request: ... Getting distribution for 'argon2-cffi'. Error: Picked: argon2-cffi = 20.1.0 Addition of argon2-cffi in commit 7d1ea024 was a last-minute change to fix jupyter SR.
-
- 15 May, 2021 1 commit
-
-
Julien Muchembled authored
-