This section outlines the steps required to install ImageDataExtractor.
We strongly advise the use of a virtual environment when installing ImageDataExtractor (Click here to learn how.)
ImageDataExtractor currently uses Tesseract 3 for text recognition. You can check your existing version by running:
$ tesseract -v
The source code for the correct installation can be downloaded here if required. Instructions for compiling on your machine can be found here.
Installation with pip
is the simplest option for getting going with ImageDataExtractor.
Simply run:
pip install ImageDataExtractor
Then download the necessary data files to run ChemDataExtractor-IDE by running:
cde data download
and you're ready to go!