mattfullerton / tika-tesseract-dockerLinks
Docker container to provide Apache Tika RESTful API
☆41Updated 9 years ago
Alternatives and similar repositories for tika-tesseract-docker
Users that are interested in tika-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆136Updated 10 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- Transform any dataset into an HTTP API with The DataTank☆82Updated 5 years ago
- Free-form web data notebook - "Data management for little guys"☆26Updated 3 months ago
- 'Git for Tabular Data'☆46Updated 9 years ago
- Guides and introductions for participating in Labs and some of its projects.☆170Updated 8 years ago
- View, visualize, clean and process data in the browser.☆147Updated 7 years ago
- CSV grooming, the JS way☆21Updated 6 years ago
- ☆24Updated 10 years ago
- display urls being tweeted with an event hashtag☆18Updated 9 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 8 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Updated 10 years ago
- ☆25Updated 9 years ago
- Breve☆29Updated 6 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 8 years ago
- [DEPRECATED] Please use http://try.goodtables.io/☆15Updated 7 years ago
- CKAN Resource View to build maps and choropleth maps☆27Updated 2 years ago
- International legislative data specifications☆101Updated 2 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 3 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- A library for making web services that make functions available as synchronous or asynchronous jobs☆21Updated last year
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Automatically exported from code.google.com/p/linked-data-api☆58Updated 6 years ago
- A design prototype for DocNow to learn with☆14Updated 8 years ago
- CFPB's streaming batch geocoder☆36Updated 8 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 10 years ago
- An extension to Google Refine that enables graphical mapping of Google Refine project data to an RDF skeleton and then exporting it in RD…☆95Updated last year
- Uduvudu☆17Updated 5 years ago