doc2data
About doc2data
doc2data is a Python library that provides functionality to train deep learning models for various document processing tasks.
Currently, models can be trained for four tasks:
- Page rotation
- Page cropping
- Document (multi-page) classification
- Token classification
Please note that doc2data is currently in a prototype stage.
Installation
pip install doc2data
Documentation
The documentation can be found here.
License
doc2data
is distributed under the terms of the Apache-2.0 license.
Credits