Tutorial material on machine learning with dirty data in Python
☆61Jul 7, 2024Updated last year
Alternatives and similar repositories for python
Users that are interested in python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the implementation of the Recursive Nearest (Neighbor) Agglomeration☆11Oct 9, 2020Updated 5 years ago
- data⎰describe: Pythonic EDA Accelerator for Data Science☆302Feb 22, 2023Updated 3 years ago
- Machine learning with dataframes☆1,612May 12, 2026Updated last week
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Sep 11, 2019Updated 6 years ago
- Blog posts I've created about python, pandas, and related topics as a series of notebooks.☆23Apr 5, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jul 28, 2020Updated 5 years ago
- Codes for "Part mutual information for quantifying direct associations in networks"☆10Sep 30, 2019Updated 6 years ago
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- ☆14Sep 8, 2023Updated 2 years ago
- Official repository for Characterization of tumor heterogeneity through segmentation-free representation learning on multiplexed imaging …☆15Sep 28, 2025Updated 7 months ago
- CWGCNA is an R package to perform causal inference from the WGCNA framework.☆19Oct 26, 2024Updated last year
- Hierarchical neural implicit inference over event ensembles. Code repository associated with https://arxiv.org/abs/2306.12584.☆13Jun 24, 2023Updated 2 years ago
- This web crawler can be customized to scrape almost all types of websites.☆11Dec 31, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath☆19Oct 28, 2017Updated 8 years ago
- Analyse your own local files with ChatGPT style interaction☆14Apr 23, 2023Updated 3 years ago
- This is docker images of Ubuntu 16.04 LTS with different versions of java☆14Dec 8, 2021Updated 4 years ago
- A QGIS3 plugin to create a water network (sewer network, river network)☆15Apr 6, 2026Updated last month
- Microbenchmark testing Python, Numba, Mojo, Dart, C/gcc, Rust, Go, JavaScript, C#, Java, Kotlin, Pascal, Ruby, Haskell performance in Man…☆15Mar 26, 2025Updated last year
- simplify geographic shapes☆12Sep 21, 2015Updated 10 years ago
- Notes from our NLP reading club!☆18Jul 17, 2021Updated 4 years ago
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model☆14Aug 15, 2023Updated 2 years ago
- ETNA – Time-Series Library☆883Aug 9, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repository with code samples for the LocalStack workshop☆19Apr 24, 2026Updated 3 weeks ago
- A lesson exploring the Julia language☆18Updated this week
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- KEN: Relational Data Embeddings☆35Jan 2, 2024Updated 2 years ago
- Master repository for the pandas-ml modules☆164Jul 23, 2023Updated 2 years ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- Spark ML Tutorial and Examples for Beginners☆18Mar 26, 2018Updated 8 years ago
- Fast sequence vectorization for metagenomics analysis. Converts input sequences into oligonucleotide frequency vectors, fast!☆14May 12, 2024Updated 2 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- ☆17Mar 9, 2021Updated 5 years ago
- Reproducible Data Science with Python☆36Jan 10, 2023Updated 3 years ago
- A small Python library for one-sided tolerance bounds and two-sided tolerance intervals.☆17Mar 27, 2023Updated 3 years ago
- Python module for calculating Climate, Drought, Food Security, Irrigation, and Water Productivity indicators mostly based on the FAO's Wa…☆13Dec 14, 2020Updated 5 years ago
- a GitHub action to run `pre-commit` with `uv`☆20May 12, 2026Updated last week
- LaTeX source code for the slides☆24Jul 15, 2021Updated 4 years ago