marmanis / yooreekaLinks
This is the "official" site of the Yooreeka project that used to be hosted on Google Code.
☆28Updated 11 months ago
Alternatives and similar repositories for yooreeka
Users that are interested in yooreeka are comparing it to the libraries listed below
Sorting:
- extensible Web Retrieval Toolkit☆17Updated 3 years ago
- ☆25Updated 9 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆96Updated 6 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Updated 6 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- General Architecture for Text Engineering☆50Updated 9 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆94Updated 7 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- ☆13Updated 9 years ago
- ☆20Updated 8 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Example programs, data, and jarfiles from book "Text Processing in Java"☆19Updated 11 years ago
- ☆33Updated 10 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Files for the Karma tutorial at TCDL, Texas Conference on Digital Libraries☆29Updated 9 years ago
- Blog crawler for the blogforever project.☆23Updated 11 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- English Dependency Relationship Extractor☆85Updated 7 months ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆27Updated 2 weeks ago
- The first Open Source document analysis platform☆65Updated 4 years ago