avriiil / stream-this-datasetLinks
Code to convert static datasets into simulated data streams
☆15Updated 2 years ago
Alternatives and similar repositories for stream-this-dataset
Users that are interested in stream-this-dataset are comparing it to the libraries listed below
Sorting:
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- Data product portal created by Dataminded☆192Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated 2 weeks ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆303Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Airbyte made simple (no UI, no database, no cluster)☆184Updated 4 months ago
- Food for thoughts around data contracts☆26Updated 3 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆375Updated 5 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 5 months ago
- Template for a data contract used in a data mesh.☆476Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆223Updated this week
- ☆49Updated last year
- New generation opensource data stack☆74Updated 3 years ago
- Airbyte deployment and configuration management tool☆12Updated 3 years ago
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆272Updated 3 months ago
- ☆211Updated 9 months ago
- Code for dbt tutorial☆162Updated last month
- Code snippets for Data Engineering Design Patterns book☆232Updated 7 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆268Updated 2 weeks ago
- ☆160Updated 2 months ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆27Updated 2 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆117Updated 6 months ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆478Updated 7 months ago
- Delta Lake helper methods. No Spark dependency.☆23Updated last year
- Supercharge BigQuery with BigFunctions☆751Updated last week
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆89Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆247Updated 2 weeks ago