kiranvodrahalli / cos521Links
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Updated 10 years ago
Alternatives and similar repositories for cos521
Users that are interested in cos521 are comparing it to the libraries listed below
Sorting:
- High performance implementations of gradient boosting, random forests, etc. in Go☆62Updated 11 years ago
- ☆37Updated 7 years ago
- Probabilistic Multiplicity Counting☆49Updated 9 years ago
- LogLog based Cardinality Estimator☆63Updated 7 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A deep, LSTM-based part of speech tagger and sentiment analyser using character embeddings instead of words. Compatible with Theano and T…☆91Updated 8 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Updated 10 years ago
- Anomaly detection training suite☆118Updated 9 years ago
- Distributed Recommender System☆28Updated 9 years ago
- Golang wrapper for Vowpal Wabbit☆35Updated 11 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- A simple database optimized for returning results by custom scoring functions.☆20Updated 9 years ago
- A Trie data structure that allows for fuzzy string matching☆11Updated 10 years ago
- A simple library for loading word2vec binary model.☆12Updated 9 years ago
- Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams☆37Updated 8 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Compressing and Decoding Term Statistics Time Series -- ECIR 2016☆10Updated 9 years ago
- KLL sketch: Almost Optimal Streaming Quantiles☆35Updated 9 years ago
- Lightweight bootstrap testing for detecting causal impact to timeseries in Go.☆17Updated 10 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 7 years ago
- Optimal Quantile Approximation in Streams☆162Updated 2 years ago
- Query engine for TrailDB☆51Updated 6 years ago
- A mulitarmed bandit to A/B test go projects, or other languages via an API.☆71Updated 11 years ago
- a data flow graphical programming language for data science☆37Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- hokusai -- sketching streams in real-time☆77Updated 8 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- A fasttext implementation based on Torch☆72Updated 9 years ago