chrismattmann / etllib
This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
☆17Updated last year
Alternatives and similar repositories for etllib:
Users that are interested in etllib are comparing it to the libraries listed below
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Web Tables Automatic Property Mapping☆7Updated 5 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…☆31Updated 7 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 7 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated last year
- Files for the Karma tutorial at TCDL, Texas Conference on Digital Libraries☆29Updated 8 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Automatically exported from code.google.com/p/tdwg-rdf☆21Updated 5 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆17Updated 2 years ago
- LINKED DATA QUALITY REPORTS☆41Updated 2 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Updated 11 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago
- Topic modeling web application☆40Updated 9 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- sparql-stream sensor queries☆16Updated 8 years ago
- ☆22Updated last year
- For interacting with nutch via Python☆25Updated last month
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- An RDF Search Engine☆57Updated 7 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- ☆13Updated 10 years ago