mattfullerton / tika-tesseract-docker
Docker container to provide Apache Tika RESTful API
☆40Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for tika-tesseract-docker
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 6 years ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Updated 9 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆139Updated 9 years ago
- [DEPRECATED] Please use https://goodtables.io☆12Updated 8 years ago
- ☆24Updated 9 years ago
- Free-form web data notebook - "Data management for little guys"☆25Updated last year
- CoVE is an web application to Convert, Validate and Explore data following certain open data standards - including 360Giving, Open Contra…☆43Updated 2 weeks ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Create and validate Data Packages in the browser☆27Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Using social media to steer web archiving and curation.☆15Updated 8 years ago
- LINKED DATA QUALITY REPORTS☆41Updated 2 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 6 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- Data Pipes for CSV☆117Updated last year
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 7 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- iServe is what we refer to as service warehouse which unifies service publication, analysis, and discovery through the use of lightweigh…☆23Updated 8 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 9 years ago
- Command line tool used for installing, updating and configuring an Open Semantic Framework instance☆45Updated 6 years ago
- Epimorphics implementation of the Linked Data API☆53Updated 3 years ago
- Segrada - Semantic Graph Database☆68Updated last year
- Exploring power and influence in the European Union by combining information from a variety of official EU data sources related to lobbyi…☆37Updated 8 years ago
- CSV grooming, the JS way☆21Updated 5 years ago
- Uduvudu☆17Updated 4 years ago