marmanis / yooreeka
This is the "official" site of the Yooreeka project that used to be hosted on Google Code.
☆28Updated 6 months ago
Alternatives and similar repositories for yooreeka:
Users that are interested in yooreeka are comparing it to the libraries listed below
- Storm / Solr Integration☆19Updated last year
- The (overall) documentation of the d:swarm platform (https://github.com/dswarm/dswarm-documentation/wiki)☆21Updated 9 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- ☆20Updated 7 years ago
- ☆24Updated 9 years ago
- ☆25Updated 9 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- Demo application for GRADOOP operators☆23Updated 4 years ago
- Dynamic data analysis over the web. The logic to your data dashboards.☆74Updated 9 years ago
- Hadoop Ecosystem Builder: Build, package, test and deploy your Hadoop ecosystem project.☆28Updated 9 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated last month
- Spring integration with Stardog RDF database☆17Updated last month
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆157Updated 11 years ago
- ☆15Updated 7 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated 11 months ago
- ☆19Updated 7 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- t test☆10Updated 10 years ago
- This is an example project that shows one way to build a RESTful Java web app around Titan, Cassandra, and Elasticsearch.☆35Updated 9 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Text Mining Library with a focus on Latent Semantic Analysis☆12Updated 11 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- Katta - distributed Lucene☆60Updated 11 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago