kiranvodrahalli / cos521
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for cos521
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆15Updated 13 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Large scale matrix factorization on GPU☆19Updated 8 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 9 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 9 years ago
- A mulitarmed bandit to A/B test go projects, or other languages via an API.☆71Updated 10 years ago
- ☆26Updated 7 years ago
- ☆38Updated 8 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 10 years ago
- Golang wrapper for Vowpal Wabbit☆34Updated 10 years ago
- KDD Hands-On Tutorial (2018)☆29Updated last year
- Distributed Recommender System☆28Updated 9 years ago
- CrowdRec reference framework☆32Updated 7 years ago
- approximate streaming quantiles☆31Updated 10 years ago
- Datasets and notebooks☆13Updated 8 years ago
- HyperLogLog and other probabilistic data structures for mining in data streams☆15Updated 9 years ago
- Search for similar short strings☆53Updated 4 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- Weighted linear regression☆15Updated 3 years ago
- ☆37Updated 6 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Exploration Library in Java☆12Updated last year
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 10 years ago