ronikobrosly / awesome-data-leadership
A curated list of awesome posts, videos, and articles on leading a data team (small and large)
☆521Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-data-leadership
- Readings for Analytics Engineers☆228Updated last year
- Collection of dbt Tips and Tricks☆369Updated 2 years ago
- Python API for Deequ☆727Updated 3 weeks ago
- This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring…☆1,050Updated last month
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆495Updated last month
- The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process …☆487Updated 3 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,903Updated this week
- A tutorial for setting a new machine with core data science tools☆263Updated last year
- A list of blogs, videos, and other content that provides advice on building experimentation and A/B testing platforms☆146Updated last year
- Clean Code concepts adapted for machine learning and data science. Now a free video series 😎 https://bit.ly/2yGDyqT☆713Updated 2 years ago
- 📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)☆552Updated last year
- A curated list of awesome dbt resources☆1,172Updated 3 weeks ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆957Updated this week
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆845Updated last year
- Collection of articles listing reasons why data science projects fail.☆457Updated 3 years ago
- A Data Engineering & Machine Learning Knowledge Hub☆1,116Updated 9 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- ☆264Updated last month
- Accumulated knowledge and experience in the field of Data Engineering☆851Updated last year
- Assets related to the operation of Fishtown Analytics.☆415Updated 3 weeks ago
- ☆444Updated last month
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆174Updated 4 months ago
- A curated collection of helpful SQL queries and functions, maintained by Count.☆201Updated 2 years ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆179Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆863Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- ☆344Updated 9 months ago
- Port(ish) of Great Expectations to dbt test macros☆1,077Updated 2 months ago
- PySpark test helper methods with beautiful error messages☆616Updated 2 weeks ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆383Updated 2 years ago