hxchua / datadoubleconfirm
Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: https://projectosyo.wixsite.com/datadoubleconfirm.
☆59Updated 8 months ago
Alternatives and similar repositories for datadoubleconfirm
Users that are interested in datadoubleconfirm are comparing it to the libraries listed below
Sorting:
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆79Updated 3 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆32Updated 3 years ago
- Start your journey into social media analysis of politicans by using Python (Tutorial)☆21Updated 6 years ago
- Examples of unfairness detection for a classification-based credit model☆19Updated 5 years ago
- Sentiment Analysis of COVID-19 Vaccine-related Twitter Data☆10Updated 3 years ago
- Predicting Employee Churn with Supervised Machine Learning☆65Updated 4 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- Pytest for Data Science Beginners☆58Updated 6 years ago
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 5 years ago
- DSSG Workshops☆50Updated 6 years ago
- Python class to perform AB test analysis☆13Updated 3 years ago
- Data Science Portfolio☆72Updated 4 years ago
- Library of automation tools for EDA and modeling☆27Updated 4 years ago
- Data Science Festival Workshop 7 November 2020 – Building a fashion recommender using Tensorflow/Keras with ASOS.☆26Updated 4 years ago
- ☆24Updated 3 years ago
- Best practices for engineering ML pipelines.☆35Updated 2 years ago
- Contains all tutorials and hands-on examples for the ODSC 2019 Workshop☆38Updated 5 years ago
- An end-to-end tutorial to forecast the M5 dataset using feature engineering pipelines and gradient boosting.☆16Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Dash app for classifying tweets in real-time☆66Updated last year
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- I will be putting notebooks created for #100daysofnlp here☆19Updated 4 years ago
- Analysis for Customer Segmentation☆69Updated 4 years ago
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.☆57Updated 4 years ago
- A guide on creating and deploying your Streamlit application to Heroku☆49Updated 9 months ago
- ☆11Updated 4 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Repository to maintain all my clutter☆75Updated 3 years ago
- How to be a Data Scientist☆34Updated 3 years ago