tabulapdf / tabula-extractorLinks
Extract tables from PDF files
☆359Updated 9 years ago
Alternatives and similar repositories for tabula-extractor
Users that are interested in tabula-extractor are comparing it to the libraries listed below
Sorting:
- Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms☆135Updated 9 years ago
- A library for extracting tables from PDF files☆89Updated 12 years ago
- make it easy to turn a lot of potentially large csv files into easily accessible open data☆198Updated 9 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- Extract tables from PDF pages.☆298Updated 5 years ago
- Command line tool for deduplicating CSV files☆432Updated 5 years ago
- Open source large document set visualization platform☆270Updated 3 years ago
- (DEPRECATED) Parser for U.S. federal regulations and other regulatory information☆55Updated 7 years ago
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆261Updated 9 years ago
- File format conversion tools☆292Updated 5 months ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 10 years ago
- Loan-level analysis of Fannie Mae and Freddie Mac data☆219Updated 5 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 10 years ago
- NICAR 2016 talk about PDFs!☆63Updated 9 years ago
- Keshif - Data Made Explorable (Prototype)☆455Updated 8 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- PostgreSQL schema and import scripts for recent US Census data☆117Updated 11 years ago
- Subscribe to your city.☆170Updated 3 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆316Updated 9 years ago
- API for accessing US data sets☆233Updated 3 years ago
- Use Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data p…☆97Updated 8 years ago
- A proofreader for your data☆695Updated 2 years ago
- Create simple APIs from CSV files☆195Updated 5 years ago
- ScraperWiki Python library for scraping and saving data; in maintenance mode☆158Updated this week
- CFPB's streaming batch geocoder☆36Updated 9 years ago
- Structured Data from PDF image-based files☆90Updated 12 years ago
- Guides and introductions for participating in Labs and some of its projects.☆171Updated 9 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- A repository of journalist's lookup tables.☆107Updated 8 years ago
- Easily crowdsource the analysis of your documents☆102Updated 8 years ago