ronikobrosly / awesome-data-leadership
A curated list of awesome posts, videos, and articles on leading a data team (small and large)
☆526Updated last year
Alternatives and similar repositories for awesome-data-leadership:
Users that are interested in awesome-data-leadership are comparing it to the libraries listed below
- Readings for Analytics Engineers☆239Updated 2 years ago
- A list of blogs, videos, and other content that provides advice on building experimentation and A/B testing platforms☆155Updated last year
- Collection of dbt Tips and Tricks☆379Updated 2 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆160Updated 3 years ago
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆856Updated last year
- 📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)☆577Updated last year
- Repository for the ActivitySchema spec and supporting materials☆410Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- A curated collection of helpful SQL queries and functions, maintained by Count.☆201Updated 3 years ago
- ☆357Updated last year
- A Data Engineering & Machine Learning Knowledge Hub☆1,119Updated last year
- A curated list of awesome dbt resources☆1,277Updated 3 weeks ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆179Updated 7 months ago
- The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process …☆498Updated 3 years ago
- The Fuzzy Labs guide to the universe of open source MLOps☆455Updated 7 months ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,016Updated this week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆861Updated last year
- A library to find and visualise the most interesting slices in multidimensional data☆106Updated 3 weeks ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆980Updated this week
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☆664Updated 4 months ago
- Code from the book Fighting Churn With Data☆282Updated 3 weeks ago
- This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring…☆1,103Updated 5 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆498Updated 3 weeks ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆184Updated last year
- Learn by doing: DIY project groups at DataTalks.Club☆396Updated 8 months ago
- Style guides and conventions☆162Updated last year
- A list of awesome data podcasts☆374Updated last year
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆124Updated 2 years ago
- A command line tool to easily add an ethics checklist to your data science projects.☆291Updated 7 months ago