mattfullerton / tika-tesseract-docker
Docker container to provide Apache Tika RESTful API
☆40Updated 8 years ago
Alternatives and similar repositories for tika-tesseract-docker:
Users that are interested in tika-tesseract-docker are comparing it to the libraries listed below
- Data Pipes for CSV☆117Updated 2 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- ☆24Updated 9 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 7 years ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Updated 10 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 8 years ago
- iServe is what we refer to as service warehouse which unifies service publication, analysis, and discovery through the use of lightweigh…☆23Updated 8 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- [DEPRECATED] Please use http://try.goodtables.io/☆15Updated 7 years ago
- Make for data☆20Updated 6 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago
- Uduvudu☆17Updated 4 years ago
- (DEPRECATED) Parser for U.S. federal regulations and other regulatory information☆54Updated 6 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆138Updated 9 years ago
- A Relaxed Schema Graph Database Management System☆52Updated 4 years ago
- The Frictionless Data website.☆31Updated 4 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 6 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Linked Data Browser☆47Updated 8 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Using social media to steer web archiving and curation.☆15Updated 9 years ago
- OpenSpending js library and mini-apps including visualizations☆35Updated 9 years ago
- A library for making web services that make functions available as synchronous or asynchronous jobs☆21Updated last year
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- Free-form web data notebook - "Data management for little guys"☆26Updated last year
- A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…☆31Updated 7 years ago
- LINKED DATA QUALITY REPORTS☆41Updated 2 years ago