Tutorial material on machine learning with dirty data in Python
☆61Jul 7, 2024Updated last year
Alternatives and similar repositories for python
Users that are interested in python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the implementation of the Recursive Nearest (Neighbor) Agglomeration☆11Oct 9, 2020Updated 5 years ago
- Package to estimate the grouping loss of a classifier, based on the paper "Beyond calibration: estimating the grouping loss of modern neu…☆11Dec 14, 2024Updated last year
- data⎰describe: Pythonic EDA Accelerator for Data Science☆301Feb 22, 2023Updated 3 years ago
- Machine learning with dataframes☆1,589Updated this week
- ☆55Mar 24, 2026Updated 2 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Simulations for predictive model selection in causal inference☆13Jan 16, 2025Updated last year
- Automated machine learning: Review of the state-of-the-art and opportunities for healthcare☆41Oct 9, 2020Updated 5 years ago
- Similarity encoding of dirty categorical variables (strings)☆20Jan 22, 2019Updated 7 years ago
- ☆12Jul 28, 2020Updated 5 years ago
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 6 months ago
- Official repository for Characterization of tumor heterogeneity through segmentation-free representation learning on multiplexed imaging …☆15Sep 28, 2025Updated 6 months ago
- ☆14Sep 8, 2023Updated 2 years ago
- ☆18Feb 28, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This web crawler can be customized to scrape almost all types of websites.☆11Dec 31, 2021Updated 4 years ago
- Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath☆19Oct 28, 2017Updated 8 years ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago
- ☆37Mar 20, 2020Updated 6 years ago
- A Mermaid widget for interactively exploring Mermaid diagrams in notebooks and Panel data apps☆12Oct 25, 2024Updated last year
- ✨A MCP server that provides intelligent access to the HoloViz ecosystem for humans and AIs.☆30Apr 1, 2026Updated last week
- Microbenchmark testing Python, Numba, Mojo, Dart, C/gcc, Rust, Go, JavaScript, C#, Java, Kotlin, Pascal, Ruby, Haskell performance in Man…☆15Mar 26, 2025Updated last year
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆21May 21, 2023Updated 2 years ago
- simplify geographic shapes☆12Sep 21, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Jan 27, 2022Updated 4 years ago
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model☆14Aug 15, 2023Updated 2 years ago
- ETNA – Time-Series Library☆885Aug 9, 2023Updated 2 years ago
- python地址解析/查区号/查邮编☆12Oct 8, 2021Updated 4 years ago
- An updated R package deeper: Deep Ensemble for Environmental Predictor☆14Mar 15, 2022Updated 4 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- ☆12Mar 15, 2023Updated 3 years ago
- KEN: Relational Data Embeddings☆34Jan 2, 2024Updated 2 years ago
- My blog☆11Nov 17, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- generic tools and pipelines that support utilization of WaPOR data as part of the Water Productivity in Practice project (WaterPIP)☆12Oct 31, 2024Updated last year
- Codes for paper "KNAS: Green Neural Architecture Search"☆93Nov 18, 2021Updated 4 years ago
- Master repository for the pandas-ml modules☆163Jul 23, 2023Updated 2 years ago
- Unofficial instructions for changing Python kernel version on Google Colab.☆25Apr 21, 2025Updated 11 months ago
- ☆27Sep 9, 2021Updated 4 years ago
- Spark ML Tutorial and Examples for Beginners☆18Mar 26, 2018Updated 8 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago