A guide for leading a data (engineering) team
☆64May 7, 2024Updated last year
Alternatives and similar repositories for run-a-data-team
Users that are interested in run-a-data-team are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Template for a DuckDB-based, Codespace-oriented sandbox project that is also dbt Cloud compatible, and includes code-first BI tooling via…☆17Apr 7, 2023Updated 3 years ago
- A Singer tap that wraps Airbyte sources allowing them to be consumed by Singer targets☆26Mar 20, 2025Updated last year
- Matatika Community Edition☆27Updated this week
- A cool simple example of functional data engineering☆35Mar 13, 2023Updated 3 years ago
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A compilation of main commands for scikit-learn with examples☆11Apr 4, 2023Updated 3 years ago
- Assets related to the operation of Fishtown Analytics.☆421Oct 18, 2024Updated last year
- Houston orchestration API. callhouston.io☆51Jun 16, 2025Updated 9 months ago
- ☆46Jun 10, 2024Updated last year
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com☆61Apr 1, 2026Updated last week
- ☆12Aug 8, 2023Updated 2 years ago
- Speaker slides and materials for SatRday Berlin 2019☆11Sep 4, 2020Updated 5 years ago
- ☆36Mar 2, 2026Updated last month
- Singer Tap for PostgreSQL☆26Apr 1, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Sep 30, 2022Updated 3 years ago
- ☆18Apr 25, 2018Updated 7 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Content published on social channels☆17Apr 5, 2025Updated last year
- learning-by-doing data model built with dbt-core☆17Mar 9, 2026Updated last month
- PHP Inhound Mail parser for different clients (Sendgrid, Gmail, Postmark etc)☆20Feb 18, 2020Updated 6 years ago
- Wrap pynamo models in pydantic schemas to make them easy to work with in FastAPI☆18May 13, 2024Updated last year
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆93Dec 22, 2025Updated 3 months ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Multi-threaded simple proxy server in Python with file caching☆11Oct 4, 2020Updated 5 years ago
- The binary build of LEO CDP Free Edition for training purposes☆53Oct 13, 2025Updated 5 months ago
- Managing Data as a Product, published by Packt☆20Nov 30, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- The Data Contract Specification Repository☆413Dec 8, 2025Updated 4 months ago
- Materials from the Data Science with Spark and R☆21Nov 15, 2018Updated 7 years ago
- A web extension to empower dbt users☆27Aug 10, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆45Jan 24, 2026Updated 2 months ago
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated last year
- ☆11Nov 21, 2023Updated 2 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- ☆19Oct 16, 2020Updated 5 years ago
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 10 months ago