natbusa / datafaucetLinks
Productivity Utilities for Data Science with Python Notebooks
☆6Updated 5 years ago
Alternatives and similar repositories for datafaucet
Users that are interested in datafaucet are comparing it to the libraries listed below
Sorting:
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- ☆19Updated 4 years ago
- ☆15Updated 6 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- ☆12Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆100Updated last week
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- ☆30Updated last year
- Project template for highly effective data science workflows☆29Updated last year
- A bit of extra usability for sqlalchemy v2.☆77Updated last year
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 3 years ago
- Study notes and demos.☆12Updated last year
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 3 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- PySpark phonetic and string matching algorithms☆39Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 9 months ago
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Workshop on Target Leakage in Machine Learning I taught at ODSC Europe 2018 (London) and ODSC East 2019, 2020 (Boston)☆37Updated 5 years ago
- Tutorials & articles on Python, leetcode problems, pandas, and more.☆26Updated 2 years ago
- In-class exercises for Deep Learning course at NYC Data Science Academy☆32Updated 7 years ago
- Data Scientist code test☆19Updated 5 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago