kiranvodrahalli / cos521
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Updated 10 years ago
Alternatives and similar repositories for cos521:
Users that are interested in cos521 are comparing it to the libraries listed below
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- HyperLogLog and other probabilistic data structures for mining in data streams☆14Updated 10 years ago
- Jeremy's Machine Learning Library☆52Updated 9 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 10 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- ☆20Updated 8 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- hacky exploratory variants on NN language models☆9Updated 9 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- ☆26Updated 8 years ago
- Large scale matrix factorization on GPU☆19Updated 8 years ago
- various simple RNNs trained on synthetic grammars☆30Updated 9 years ago
- Python implementation of nonparametric nearest-neighbor-based estimators for divergences between distributions.☆48Updated 8 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- Locality Sensitive Hashing using Golang and SQL database☆28Updated 8 years ago
- Datasets and notebooks☆13Updated 8 years ago
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- TREC Real-Time Summarization Tools☆15Updated 7 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿☆17Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- A board game recommendation engine/model/website.☆39Updated 8 years ago