0 purchases
altotools 0.1.0
ALTO Tools
:snake: tools for performing various operations on ALTO XML files
Installation
Clone the repository, enter it and run
pip install .
Usage
alto-tools <INPUT> [OPTION]
INPUT should be the path to an ALTO file or directory containing ALTO files.
Output is sent to stdout.
OPTION
Description
-t --text
Extract UTF-8 encoded text content
-c --confidence
Extract mean OCR word confidence score
-i --illustrations
Extract bounding box coordinates of <Illustration> elements
-g --graphics
Extract bounding box coordinates of <GraphicalElement> elements
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.