Snape is a convenient artificial dataset generator that wraps sklearn's make_classification and make_regression and then adds in 'realism' features such as complex formating, varying scales, categorical variables, and missing values.
☆166May 20, 2020Updated 5 years ago
Alternatives and similar repositories for snape
Users that are interested in snape are comparing it to the libraries listed below
Sorting:
- ☆10Nov 19, 2015Updated 10 years ago
- Documents for the project Libraccess☆13Jan 30, 2015Updated 11 years ago
- ☆84Mar 9, 2018Updated 7 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆174Nov 3, 2016Updated 9 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23May 19, 2015Updated 10 years ago
- ☆16Dec 14, 2015Updated 10 years ago
- Data and regressions on Premier League teams from 2000-01 through to 2016-17☆11Jul 31, 2017Updated 8 years ago
- Bayesian Regression Models using pymc3☆11Feb 4, 2017Updated 9 years ago
- Price options by fitting a Lévy distribution☆10Jan 20, 2021Updated 5 years ago
- A project to translate the Voynich Manuscript into English☆11Jun 30, 2023Updated 2 years ago
- Notebooks on fitting mixed-effects models in Julia☆24Nov 3, 2017Updated 8 years ago
- Online Interpretable Word Embeddings☆37Nov 17, 2015Updated 10 years ago
- ☆12May 25, 2018Updated 7 years ago
- Hugo Awards nominating and voting☆15Updated this week
- Repository for the Health Search Tutorial☆12Aug 27, 2018Updated 7 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Amino-Acid Sequence Annotation Predictor (ASAP)☆25May 13, 2020Updated 5 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Jun 3, 2014Updated 11 years ago
- Natural language processing using unsupervised vectors representation.☆105Jan 24, 2020Updated 6 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆489Aug 11, 2017Updated 8 years ago
- 150,000 tweets from 2016's second presdential debate between Hillary Clinton and Donald Trump☆11Oct 10, 2016Updated 9 years ago
- ☆13Aug 17, 2017Updated 8 years ago
- 📽s from "That's not [data] science!"☆14Sep 4, 2018Updated 7 years ago
- code for Seattle Twitter-Dev Meetup, October 2016☆13Oct 26, 2016Updated 9 years ago
- Tutorial on multilevel modeling, using Gelman radon example☆58Jul 23, 2015Updated 10 years ago
- Efficient distributed hyperparameter search library written in Python.☆74Aug 14, 2018Updated 7 years ago
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆467Feb 27, 2025Updated last year
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- sciblox - Easier Data Science and Machine Learning☆51Jul 28, 2017Updated 8 years ago
- A domain-general, Bayesian method for analyzing high-dimensional data tables☆328Feb 22, 2024Updated 2 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Oct 1, 2017Updated 8 years ago
- ☆190Jul 6, 2023Updated 2 years ago
- Companion code for my video course on Practical Python Data Science Techniques, published by Packt Publishing☆34Sep 14, 2017Updated 8 years ago
- Quorum DevOps☆16Jan 23, 2022Updated 4 years ago
- ☆12Jul 9, 2017Updated 8 years ago
- ☆15Jul 26, 2019Updated 6 years ago
- Slides and materials for most of my talks by year☆93Sep 14, 2023Updated 2 years ago
- Notebook comparing scikit-learn and Spark ML for building Machine Learning Pipelines☆13Oct 8, 2015Updated 10 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago