tdunning / storm-counts
☆22Updated this week
Related projects: ⓘ
- NLP Utilities in Java☆43Updated last year
- iSAX Indexing persisted in HBase☆39Updated 13 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆16Updated 8 years ago
- A project for code to create models from existing corpora and distribute models.☆42Updated 12 years ago
- xlvector's solution of github contest☆33Updated 15 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated last year
- Document clustering based on Latent Semantic Analysis☆96Updated 14 years ago
- Mahout vector encoding for pig☆54Updated last year
- distributed latent dirichlet allocation☆30Updated 12 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 9 years ago
- Jeremy's Machine Learning Library☆52Updated 8 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- ☆11Updated this week
- ☆69Updated this week
- A Hadoop toolkit for web-scale information retrieval research☆79Updated 9 years ago
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆49Updated 14 years ago
- simple simhashing in hadoop with cascading☆33Updated 13 years ago
- Website for standardized execution and evaluation of algorithms on datasets.☆36Updated 4 years ago
- Parallel Algorithms in Python for Hadoop/Mapreduce☆56Updated 12 years ago
- Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled wit…☆18Updated 13 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 6 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 11 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 10 years ago
- Toy single-machine implementation of the Pregel graph-based framework☆112Updated 7 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Updated 11 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 6 years ago
- ☆30Updated this week
- ☆11Updated this week
- Distributed Matrix Library☆70Updated 7 years ago