sjyk / datacleaning-benchmark
☆38Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for datacleaning-benchmark
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)☆47Updated 3 years ago
- KDD Hands-On Tutorial (2018)☆29Updated last year
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- Machine learning evaluation database☆24Updated 6 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21Updated 6 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- ☆26Updated 7 years ago
- CrowdRec reference framework☆32Updated 7 years ago
- Python application to setup and run streaming (contextual) bandit experiments.☆79Updated last year
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 6 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- Datasets and notebooks☆13Updated 8 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- NLP tutorial for the Berlin Data Science Retreat☆41Updated 8 years ago
- Using Genetic Algorithms to aid Machine Learning☆18Updated 6 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Updated 8 years ago
- Machine Learning Versioning made Simple☆38Updated 2 years ago
- Dynamic Topic Model (based upon code released by David Blei at http://www.cs.princeton.edu/~blei/topicmodeling.html)☆31Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- ☆26Updated 8 years ago
- Kaggle competition results☆20Updated 5 years ago