dannguyen / abbyy-finereader-ocr-senateView external linksLinks
Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms
☆135Mar 22, 2016Updated 9 years ago
Alternatives and similar repositories for abbyy-finereader-ocr-senate
Users that are interested in abbyy-finereader-ocr-senate are comparing it to the libraries listed below
Sorting:
- NICAR 2016 talk about PDFs!☆63Mar 12, 2016Updated 9 years ago
- Patterns in NYT production from 1987 to 2007☆11Nov 6, 2017Updated 8 years ago
- ☆23Mar 7, 2015Updated 10 years ago
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- A tutorial repo for transparent and reproducible data journalism.☆14Jul 2, 2017Updated 8 years ago
- ☆14May 15, 2018Updated 7 years ago
- Handouts/Tipsheets for the 2015 Global Investigative Journalism Conference☆10Oct 9, 2015Updated 10 years ago
- Various NLP-related stuff☆10Apr 13, 2017Updated 8 years ago
- Simple library for storing Scrapy Items in sqlite database☆12Jan 28, 2016Updated 10 years ago
- ☆12May 12, 2016Updated 9 years ago
- Call in to record & dynamically generate a JSON audio feed for Alexa Flash Briefings☆14Jul 28, 2017Updated 8 years ago
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- Newsclipse: The IDE for news production.☆91Dec 11, 2014Updated 11 years ago
- Turn raw electronic FEC filings into meaningful data☆19May 20, 2016Updated 9 years ago
- Code for the Deep Learning HackerEarth Challenge #1☆12Nov 1, 2017Updated 8 years ago
- Parser and standardizer for politician, individual and organization names.☆128May 18, 2017Updated 8 years ago
- Food News is Hacker News for food, built using Drum & Mezzanine. Uses Mozilla Persona for authentication.☆67May 10, 2015Updated 10 years ago
- A collection of lists of forms maintained by local, state and federal policing organizations. If you have a form name, you have a FOIA re…☆18Updated this week
- Code to package FiveThirtyEight data using Datasette☆16Nov 19, 2022Updated 3 years ago
- Python package implementing the greedy string tiling algorithm for comparing string similarity☆12Mar 20, 2023Updated 2 years ago
- A how-to do a mass collection of FEC data using the command-line and regular expressions☆29Mar 18, 2016Updated 9 years ago
- Jupyter Notebook extension to track notebook history☆10Nov 8, 2017Updated 8 years ago
- A financial disclosure data extraction tool.☆19Aug 2, 2023Updated 2 years ago
- ☆15Mar 11, 2024Updated last year
- Reverse proxy for RethinkDB☆34Sep 30, 2018Updated 7 years ago
- NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs☆12Mar 9, 2019Updated 6 years ago
- k8s-trailhead is intended to be a starting point for any engineers who are interested in interacting with Kubernetes via the Golang clien…☆16Jan 27, 2018Updated 8 years ago
- Machine Learning Hackathon organized by Hackerearth☆13Feb 2, 2016Updated 10 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- Mapping the growth of Wal-Mart in urban areas.☆15Apr 1, 2015Updated 10 years ago
- Simple Twilio conferencing system with softphone and professional recordings☆15Jan 7, 2020Updated 6 years ago
- MathJax and TeX pastebin☆14Apr 11, 2017Updated 8 years ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆745Feb 25, 2019Updated 6 years ago
- A Los Angeles Times analysis of serious assaults misclassified by LAPD☆63Oct 21, 2018Updated 7 years ago
- ☆23Dec 27, 2024Updated last year
- A PSR15 middleware dispatcher☆17Apr 30, 2019Updated 6 years ago
- Extract Stats Q/A from Tables With Provenance☆26Dec 27, 2025Updated last month
- Provide some useful utils for the php CLI. console color, CLI env, CLI code highlighter.☆20Nov 24, 2025Updated 2 months ago
- Implementation of some ideas from ggplot2 on top of d3.js☆123May 15, 2024Updated last year