kiranvodrahalli / cos521Links
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Updated 10 years ago
Alternatives and similar repositories for cos521
Users that are interested in cos521 are comparing it to the libraries listed below
Sorting:
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- HyperLogLog and other probabilistic data structures for mining in data streams☆14Updated 10 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 10 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆9Updated 7 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Large scale matrix factorization on GPU☆19Updated 9 years ago
- Using Word2Vec on lists and sets☆34Updated 3 weeks ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- Simple factoid question answering system☆23Updated 9 years ago
- Jeremy's Machine Learning Library☆52Updated 9 years ago
- CrowdRec reference framework☆32Updated 8 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- ☆26Updated 8 years ago
- Datasets and notebooks☆13Updated 8 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Code for KDD 2014☆16Updated 10 years ago
- ☆25Updated 9 years ago
- ☆20Updated 8 years ago
- Common Code Workflow tutorial on Theano☆16Updated 9 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- various simple RNNs trained on synthetic grammars☆30Updated 9 years ago