dannguyen / abbyy-finereader-ocr-senateLinks
Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms
☆134Updated 9 years ago
Alternatives and similar repositories for abbyy-finereader-ocr-senate
Users that are interested in abbyy-finereader-ocr-senate are comparing it to the libraries listed below
Sorting:
- A collection of tools for mining government data☆141Updated 9 years ago
- Extract tables from PDF files☆359Updated 9 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 9 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆112Updated 10 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆311Updated 9 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- Keshif - Data Made Explorable (Prototype)☆457Updated 8 years ago
- Tool for visual exploration of complex data.☆193Updated 7 years ago
- Create simple APIs from CSV files☆195Updated 5 years ago
- We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…☆109Updated 6 years ago
- ☆89Updated 10 years ago
- A framework for visualizing parent-child relationships with d3js☆116Updated 7 years ago
- Download Hillary Clinton's emails and query them with sqlite☆153Updated 5 years ago
- A Python web application for converting PDF forms into PDF-filling APIs☆48Updated 4 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- TensorFlow for AWS☆116Updated 10 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆315Updated 9 years ago
- Repository for PyCon 2016 workshop Natural Language Processing in 10 Lines of Code☆240Updated 8 years ago
- Extract tabular data and semantically discover it with ease! (OS)☆21Updated 9 years ago
- Mechanical Turk on your own machine.☆208Updated last year
- Loan-level analysis of Fannie Mae and Freddie Mac data☆219Updated 5 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆242Updated 2 weeks ago
- Document processing for investigations☆250Updated 8 years ago
- make it easy to turn a lot of potentially large csv files into easily accessible open data☆198Updated 9 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 12 years ago
- online natural language processing with word vectors☆310Updated last year
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- Examples for http://dataviztalk.blogspot.com☆21Updated 9 years ago