raleighpublicrecord / dochiveLinks
Structured Data from PDF image-based files
☆88Updated 12 years ago
Alternatives and similar repositories for dochive
Users that are interested in dochive are comparing it to the libraries listed below
Sorting:
- A platform for tools that do stuff with data☆56Updated 6 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- A Relaxed Schema Graph Database Management System☆52Updated 5 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 10 years ago
- Schemas and helpful handlers for OADA-related formats.☆16Updated 4 years ago
- LIME (Language Independent Markup Editor)☆31Updated 6 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆137Updated 9 years ago
- (DEPRECATED) Web interface for viewing U.S. federal regulations and other regulatory information☆28Updated 6 years ago
- Just like on ScraperWiki Classic; now a part of QuickCode.☆38Updated 8 years ago
- Data storytelling. See link for detailed documentations: http://lab41.github.io/gestalt.☆20Updated 8 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 3 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Compile Yahoo! Pipes to Javascript (Node.js)☆44Updated 12 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- The news homepage archive☆80Updated 3 years ago
- Ideas for (tech) stuff to research, build or work on.☆50Updated 5 months ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 2 months ago
- View, visualize, clean and process data in the browser.☆148Updated 7 years ago
- Provide RESTful access to SKOS vocabularies☆58Updated 2 years ago
- Whippersnapper is an automated screenshot tool to keep a visual history of content on the web.☆55Updated 9 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 11 years ago
- generate rules from lists of words☆16Updated 3 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆39Updated last year
- Scan a folder of document files of all types and extract the text into a CSV suitable for Overview☆26Updated 9 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago