marmanis / yooreekaLinks
This is the "official" site of the Yooreeka project that used to be hosted on Google Code.
☆28Updated last year
Alternatives and similar repositories for yooreeka
Users that are interested in yooreeka are comparing it to the libraries listed below
Sorting:
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 3 years ago
- Dynamic data analysis over the web. The logic to your data dashboards.☆74Updated 9 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- extensible Web Retrieval Toolkit☆17Updated 3 years ago
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆159Updated 11 years ago
- Example programs, data, and jarfiles from book "Text Processing in Java"☆19Updated 11 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- ☆33Updated 11 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 4 years ago
- Trending on Accumulo☆40Updated 13 years ago
- personal cheatsheets on various technologies☆25Updated 9 years ago
- Graph Analytics Engine☆260Updated 11 years ago
- Storm / Solr Integration☆19Updated last year
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- Tuffy, a Markov Logic Network solver☆26Updated 11 years ago
- Text Mining Library with a focus on Latent Semantic Analysis☆12Updated 12 years ago
- Public code files for the DDL blog☆56Updated 7 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 8 years ago
- A protovis visualization of the linked open data cloud.☆26Updated 14 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆57Updated 13 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 7 months ago
- A toolkit for clustering web pages based on various similarity measures.☆34Updated 4 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Old and outdated version of RapidMiner Studio 5. See rapidminer-studio for the latest version 7.x☆122Updated 11 years ago
- ☆20Updated 9 years ago
- ☆11Updated 8 years ago