raleighpublicrecord / dochive
Structured Data from PDF image-based files
☆87Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for dochive
- Tools for working with Optical Character Recognition output☆16Updated 10 years ago
- A place to collect and share knowledge about liberating data from PDFs☆53Updated 2 years ago
- Docker container to provide Apache Tika RESTful API☆40Updated 8 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Updated 11 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆139Updated 9 years ago
- Data storytelling. See link for detailed documentations: http://lab41.github.io/gestalt.☆20Updated 8 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Schemas and helpful handlers for OADA-related formats.☆16Updated 4 years ago
- A Python web application for converting PDF forms into PDF-filling APIs☆46Updated 3 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- ☆24Updated 9 years ago
- Monitor datasets, gets alerts when something happens☆211Updated 5 years ago
- (DEPRECATED) Parser for U.S. federal regulations and other regulatory information☆54Updated 6 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 6 years ago
- The news homepage archive☆81Updated 3 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- Open Data Index website☆37Updated 6 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Data Quality Dashboards display statistics on a collection of published data.☆33Updated 4 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago
- Charts for the Consumer Financial Protection Bureau☆12Updated 7 months ago
- Parser for U.S. federal regulations and other regulatory information☆38Updated last year
- A suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groups☆62Updated 7 months ago
- Segrada - Semantic Graph Database☆68Updated last year