A guide for leading a data (engineering) team
☆65May 7, 2024Updated 2 years ago
Alternatives and similar repositories for run-a-data-team
Users that are interested in run-a-data-team are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 30, 2022Updated 3 years ago
- Template for a DuckDB-based, Codespace-oriented sandbox project that is also dbt Cloud compatible, and includes code-first BI tooling via…☆17Apr 7, 2023Updated 3 years ago
- Miscellaneous files for cfbscrapR (also includes work for EPA/WPA models)☆10Oct 7, 2020Updated 5 years ago
- data-mesh-demo☆13Apr 12, 2022Updated 4 years ago
- A Singer tap that wraps Airbyte sources allowing them to be consumed by Singer targets☆26Mar 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Matatika Community Edition☆28Jun 19, 2026Updated last week
- A cool simple example of functional data engineering☆35Mar 13, 2023Updated 3 years ago
- An implementation template for a Kimball (dimensional) Data Warehouse☆43Nov 6, 2018Updated 7 years ago
- ☆46Jun 10, 2024Updated 2 years ago
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com☆63Jun 19, 2026Updated last week
- ☆12Aug 8, 2023Updated 2 years ago
- ☆40Mar 2, 2026Updated 3 months ago
- Singer Tap for PostgreSQL☆26Updated this week
- Where the Meltano team runs Meltano! Get it???☆31Apr 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Have your first meltano project running within 5 minutes - no setup - no install - no boundaries. All inside GitHub Codespaces. (GitHub a…☆56Apr 7, 2025Updated last year
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- learning-by-doing data model built with dbt-core☆17Apr 10, 2026Updated 2 months ago
- Content published on social channels☆17Apr 5, 2025Updated last year
- PHP Inhound Mail parser for different clients (Sendgrid, Gmail, Postmark etc)☆20Feb 18, 2020Updated 6 years ago
- Wrap pynamo models in pydantic schemas to make them easy to work with in FastAPI☆18May 13, 2024Updated 2 years ago
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆94Jun 10, 2026Updated 2 weeks ago
- Demo of orchestrating Airbyte connections with Prefect☆11Mar 3, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An incubating Debezium CDC connector for for IBM i (AS/400). Please log issues at https://github.com/debezium/dbz/issues.☆20Updated this week
- ☆11Mar 2, 2026Updated 3 months ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Multi-threaded simple proxy server in Python with file caching☆11Oct 4, 2020Updated 5 years ago
- Collection of shell/Bash scripts for various using cases | #SE☆11Jun 8, 2026Updated 3 weeks ago
- ☆12Jan 10, 2023Updated 3 years ago
- Managing Data as a Product, published by Packt☆23Nov 30, 2024Updated last year
- The Data Contract Specification Repository☆416Dec 8, 2025Updated 6 months ago
- Materials from the Data Science with Spark and R☆21Nov 15, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A web extension to empower dbt users☆27Aug 10, 2022Updated 3 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆18Jun 19, 2022Updated 4 years ago
- [REMIX FORK] My Personal One-Click-App Collection for CapRover☆16Nov 21, 2021Updated 4 years ago
- ☆11Nov 21, 2023Updated 2 years ago
- Trino Iceberg Metadata Insights via Streamlit☆17Apr 9, 2025Updated last year
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆45Jan 24, 2026Updated 5 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆70May 4, 2026Updated last month