usnistgov / ocr-pipelineView external linksLinks
Convert a corpus of PDF to clean text files on a distributed architecture
☆38Mar 5, 2024Updated last year
Alternatives and similar repositories for ocr-pipeline
Users that are interested in ocr-pipeline are comparing it to the libraries listed below
Sorting:
- This is a Django project template using uWSGI as application server.☆10May 15, 2019Updated 6 years ago
- Trading Consequences data and code☆15Mar 5, 2015Updated 10 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- Watching the SCOTUS☆178Oct 7, 2015Updated 10 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Sep 14, 2016Updated 9 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- Want to learn more about Free Law Project technologies, policies and thinking? Get the literature here.☆25Jul 6, 2021Updated 4 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Jul 6, 2022Updated 3 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Sep 24, 2015Updated 10 years ago
- Destiny 2 weapon rolls for DIM☆12Updated this week
- ☆25Oct 9, 2022Updated 3 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API