kiranvodrahalli / cos521Links
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Updated 10 years ago
Alternatives and similar repositories for cos521
Users that are interested in cos521 are comparing it to the libraries listed below
Sorting:
- HyperLogLog and other probabilistic data structures for mining in data streams☆14Updated 10 years ago
- A mulitarmed bandit to A/B test go projects, or other languages via an API.☆71Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆9Updated 8 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- Code for KDD 2014☆16Updated 10 years ago
- Using Word2Vec on lists and sets☆34Updated last month
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- approximate streaming quantiles☆31Updated 11 years ago
- Large scale matrix factorization on GPU☆19Updated 9 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆102Updated 4 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Jeremy's Machine Learning Library☆52Updated 9 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 11 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 10 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- Seldon Spark Jobs☆26Updated 10 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- A board game recommendation engine/model/website.☆39Updated 8 years ago
- CrowdRec reference framework☆32Updated 8 years ago
- A deep, LSTM-based part of speech tagger and sentiment analyser using character embeddings instead of words. Compatible with Theano and T…☆91Updated 8 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- Power-Law Distribution Analysis☆26Updated 6 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 9 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago