A curated list of awesome posts, videos, and articles on leading a data team (small and large)
☆548Jan 5, 2026Updated 2 months ago
Alternatives and similar repositories for awesome-data-leadership
Users that are interested in awesome-data-leadership are comparing it to the libraries listed below
Sorting:
- Resources for Survival Analysis☆99Jul 3, 2025Updated 8 months ago
- Collection of dbt Tips and Tricks☆399Oct 12, 2022Updated 3 years ago
- Code from my blog series: R Tips & Tricks☆15Feb 1, 2024Updated 2 years ago
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two line…☆669Feb 22, 2025Updated last year
- A python package with tools to perform causal inference using observational data when the treatment of interest is continuous.☆281Jan 5, 2026Updated 2 months ago
- An open-source ML pipeline development platform☆998Jan 9, 2025Updated last year
- A curated list of references for MLOps☆13,789Nov 21, 2024Updated last year
- A helper tool for dbt development and data warehouse management.☆118Oct 11, 2020Updated 5 years ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,623May 29, 2025Updated 9 months ago
- Machine Learning Notebooks☆3,436Apr 9, 2024Updated last year
- ☆384Jan 24, 2024Updated 2 years ago
- Toolkit for developing and maintaining ML models☆151Jun 6, 2024Updated last year
- nannyml: post-deployment data science in python☆2,127Jul 12, 2025Updated 7 months ago
- 📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.☆28,706Jul 18, 2024Updated last year
- Always know what to expect from your data.☆11,224Updated this week
- ☆87Sep 26, 2025Updated 5 months ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,159Mar 1, 2026Updated last week
- ☆17Jul 26, 2022Updated 3 years ago
- An Opinionated Quarto Book on being a Data Director in Democratic Politics☆14Feb 19, 2023Updated 3 years ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning☆20,228Updated this week
- A repository of all of the materials shared throughout the NHS-R conference 2022☆19Nov 29, 2022Updated 3 years ago
- Notebook seen in Jeremy Howard's keynote at posit::conf(2023)☆19Sep 21, 2023Updated 2 years ago
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆728Feb 8, 2023Updated 3 years ago
- 📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)☆647Mar 16, 2023Updated 2 years ago
- Pre-alpha {targets} workshop☆36Mar 3, 2026Updated last week
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,708Mar 4, 2026Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,502Mar 3, 2026Updated last week
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- Compare tables within or across databases☆2,991May 17, 2024Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆862Jul 3, 2023Updated 2 years ago
- Repository for the ActivitySchema spec and supporting materials☆438Dec 20, 2022Updated 3 years ago
- ☆20Aug 23, 2025Updated 6 months ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,781Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,569Apr 30, 2024Updated last year
- Materials for ShinyConf 2023 talk: Debugging With Shiny☆13May 10, 2023Updated 2 years ago
- An introduction to GAM(M)s☆10Dec 14, 2022Updated 3 years ago
- The 'ggcorset' package - Introducing corset plots!☆34Sep 6, 2024Updated last year
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,303Updated this week
- Write python locally, execute SQL in your data warehouse☆268Jul 5, 2022Updated 3 years ago