Skip to content

resegment: running for 155 minutes(?)... #73

@jbarth-ubhd

Description

@jbarth-ubhd

and still running.

Workflow:

. /usr/local/ocrd_all/venv/bin/activate
export TMPDIR=/dwork/tmp
export LD_LIBRARY_PATH=/usr/local/lib:$LD_LIBRARY_PATH
ocrd-create-mets.xml
( /usr/bin/time ocrd process \
"olena-binarize -I OCR-D-IMG -O OCR-D-N1 -P impl wolf" \
"anybaseocr-crop -I OCR-D-N1 -O OCR-D-N2" \
"olena-binarize -I OCR-D-N2 -O OCR-D-N3 -P impl wolf" \
"cis-ocropy-denoise -I OCR-D-N3 -O OCR-D-N4 -P level-of-operation page" \
"cis-ocropy-deskew -I OCR-D-N4 -O OCR-D-N5 -P level-of-operation page" \
"pc-segmentation -I OCR-D-N5 -O OCR-D-N6" \
"cis-ocropy-deskew -I OCR-D-N6 -O OCR-D-N7 -P level-of-operation region" \
"tesserocr-segment-line -I OCR-D-N7 -O OCR-D-N8" \
"cis-ocropy-resegment -I OCR-D-N8 -O OCR-D-N9" \
"cis-ocropy-dewarp -I OCR-D-N9 -O OCR-D-N10" \
"calamari-recognize -I OCR-D-N10 -O OCR-D-OCR -P checkpoint /usr/local/ocrd_models/calamari/calamari_models-0.3/fraktur_19th_century/*.ckpt.json"

) >cmd.log 2>&1
ps axf
ls       66073  0.0  0.0   4384   744 pts/0    S    14:40   0:00                                  |   \_ /usr/bin/time ocrd process olena-binarize -I O[44/1843]
-O OCR-D-N1 -P impl wolf anybaseocr-crop -I OCR-D-N1 -O OCR-D-N2 olena-binarize -I OCR-D-N2 -O OCR-D-N3 -P impl wolf cis-ocropy-denoise -I OCR-D-N3 -O OCR-D-N4
-P level-of-operation page cis-ocropy-deskew -I OCR-D-N4 -O OCR-D-N5 -P level-of-operation page pc-segmentation -I OCR-D-N5 -O OCR-D-N6 cis-ocropy-deskew -I OCR
-D-N6 -O OCR-D-N7 -P level-of-operation region tesserocr-segment-line -I OCR-D-N7 -O OCR-D-N8 cis-ocropy-resegment -I OCR-D-N8 -O OCR-D-N9 cis-ocropy-dewarp -I
OCR-D-N9 -O OCR-D-N10 calamari-recognize -I OCR-D-N10 -O OCR-D-OCR -P checkpoint /usr/local/ocrd_models/calamari/calamari_models-0.3/fraktur_19th_century/*.ckpt
.json
ls       66074  0.0  0.0 2423620 68968 pts/0   S    14:40   0:05                                  |       \_ /dwork/ocrd-schroot-ubuntu-eoan/usr/local/ocrd_all/
venv/bin/python3.7 /dwork/ocrd-schroot-ubuntu-eoan/usr/local/ocrd_all/venv/bin/ocrd process olena-binarize -I OCR-D-IMG -O OCR-D-N1 -P impl wolf anybaseocr-crop
 -I OCR-D-N1 -O OCR-D-N2 olena-binarize -I OCR-D-N2 -O OCR-D-N3 -P impl wolf cis-ocropy-denoise -I OCR-D-N3 -O OCR-D-N4 -P level-of-operation page cis-ocropy-de
skew -I OCR-D-N4 -O OCR-D-N5 -P level-of-operation page pc-segmentation -I OCR-D-N5 -O OCR-D-N6 cis-ocropy-deskew -I OCR-D-N6 -O OCR-D-N7 -P level-of-operation
region tesserocr-segment-line -I OCR-D-N7 -O OCR-D-N8 cis-ocropy-resegment -I OCR-D-N8 -O OCR-D-N9 cis-ocropy-dewarp -I OCR-D-N9 -O OCR-D-N10 calamari-recognize
 -I OCR-D-N10 -O OCR-D-OCR -P checkpoint /usr/local/ocrd_models/calamari/calamari_models-0.3/fraktur_19th_century/*.ckpt.json
ls        2747  116  0.3 11505348 519324 pts/0 Rl   16:44 160:53                                  |           \_ /dwork/ocrd-schroot-ubuntu-eoan/usr/local/ocrd_
all/venv/bin/python3.7 /dwork/ocrd-schroot-ubuntu-eoan/usr/local/ocrd_all/venv/bin/ocrd-cis-ocropy-resegment --working-dir /_digi8+9/digitalisate8/ocr-d/testset
/x,pc-segmentation,tesserocr-segment-line,calamari-frak19th --mets mets.xml --input-file-grp OCR-D-N8 --output-file-grp OCR-D-N9 --parameter {"dpi": 0, "min_fra
ction": 0.8, "extend_margins": 3}

@bertsky: same image set as in last email.

PS: no cis-ocropy-clip for obvious reasons :-)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions