mattfullerton / tika-tesseract-dockerLinks
Docker container to provide Apache Tika RESTful API
☆41Updated 9 years ago
Alternatives and similar repositories for tika-tesseract-docker
Users that are interested in tika-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆136Updated 10 years ago
- ☆24Updated 10 years ago
- View, visualize, clean and process data in the browser.☆146Updated 7 years ago
- Guides and introductions for participating in Labs and some of its projects.☆170Updated 9 years ago
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- CFPB's streaming batch geocoder☆36Updated 9 years ago
- Transform any dataset into an HTTP API with The DataTank☆82Updated 5 years ago
- CSV grooming, the JS way☆21Updated 6 years ago
- 'Git for Tabular Data'☆46Updated 9 years ago
- Free-form web data notebook - "Data management for little guys"☆26Updated 5 months ago
- Structured Data from PDF image-based files☆89Updated 12 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆58Updated 3 years ago
- Where things are (and what they mean) in Who's On First.☆31Updated 5 months ago
- Open source large document set visualization platform☆270Updated 2 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- CKAN Resource View to build maps and choropleth maps☆27Updated 2 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Schemas and helpful handlers for OADA-related formats.☆16Updated 5 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 9 years ago
- Breve☆29Updated 6 years ago
- A place to collect and share knowledge about liberating data from PDFs☆55Updated 3 years ago
- Tools for text tokenization and encoding☆84Updated 4 years ago
- OpenSpending js library and mini-apps including visualizations☆35Updated 9 years ago
- display urls being tweeted with an event hashtag☆18Updated 9 years ago
- Make for data☆21Updated 7 years ago
- [DEPRECATED] Please use https://datahub.io/docs/features/data-cli☆109Updated 7 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- The Frictionless Data website.☆31Updated 5 years ago