jeremybarnes / jmlLinks
Jeremy's Machine Learning Library
☆52Updated 9 years ago
Alternatives and similar repositories for jml
Users that are interested in jml are comparing it to the libraries listed below
Sorting:
- distributed latent dirichlet allocation☆30Updated 14 years ago
- Database server based on leveldb storage engine☆121Updated 9 years ago
- ☆116Updated 13 years ago
- MapReduce with ZeroMQ☆121Updated 2 years ago
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆49Updated 15 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆161Updated 3 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆508Updated 7 years ago
- playing around with the common crawl dataset☆70Updated 13 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆337Updated 14 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆101Updated 10 years ago
- Pretty fast parser for probabilistic context free grammars☆88Updated 12 years ago
- Toy single-machine implementation of the Pregel graph-based framework☆118Updated 9 years ago
- A Redis-backed storage engine for timelines☆134Updated 8 years ago
- A fast HTTP/WebSocket to zeromq gateway (UNMAINTAINED, take a look at swindon web server instead)☆247Updated 10 years ago
- Tiny data structures that pack a punch!☆101Updated 13 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- (DEPRECATED. This project is no longer used or maintained at LiveRamp.) Hank is a high performance distributed key-value NoSQL database t…☆175Updated 5 years ago
- Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop☆85Updated 12 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 14 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆173Updated 13 years ago
- Social Graph Analysis using Elastic MapReduce and PyPy☆55Updated 14 years ago
- C network daemon for HyperLogLogs☆451Updated 4 years ago
- S4 repository☆141Updated 14 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆294Updated 3 years ago
- ScalienDB is a scalable, replicated datastore.☆87Updated 12 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 12 years ago
- A toy school project intended to be an approximate clone of Google's Megastore database for geographically-distributed scalable fault-to…☆35Updated 14 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Updated 14 years ago
- Unbounded stream processing in node.js☆63Updated 12 years ago