avriiil / stream-this-datasetLinks
Code to convert static datasets into simulated data streams
☆15Updated 2 years ago
Alternatives and similar repositories for stream-this-dataset
Users that are interested in stream-this-dataset are comparing it to the libraries listed below
Sorting:
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated last week
- Airbyte made simple (no UI, no database, no cluster)☆192Updated 5 months ago
- Data Product Portal created by Dataminded☆195Updated last week
- PyAirbyte brings the power of Airbyte to every Python developer.☆306Updated last week
- Home of the Open Data Contract Standard (ODCS).☆592Updated this week
- PySpark test helper methods with beautiful error messages☆730Updated 2 months ago
- ☆40Updated 7 months ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- Template for a data contract used in a data mesh.☆484Updated last year
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆375Updated 6 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆256Updated last month
- ☆48Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 6 months ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- Dagster Labs' open-source data platform, built with Dagster.☆418Updated last week
- Delta Lake helper methods in PySpark☆324Updated last year
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆274Updated 4 months ago
- ☆212Updated 10 months ago
- ☆120Updated 4 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆227Updated last month
- Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate☆116Updated 2 years ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆479Updated 8 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆129Updated 3 weeks ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆77Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆274Updated last month
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆154Updated last year
- ☆42Updated 4 years ago
- Dagster University courses☆116Updated last week