avriiil / stream-this-datasetLinks
Code to convert static datasets into simulated data streams
☆14Updated 2 years ago
Alternatives and similar repositories for stream-this-dataset
Users that are interested in stream-this-dataset are comparing it to the libraries listed below
Sorting:
- ☆49Updated last year
- Airbyte made simple (no UI, no database, no cluster)☆174Updated last month
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 3 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆218Updated 2 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆195Updated last week
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆185Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆86Updated 2 years ago
- Food for thoughts around data contracts☆26Updated 4 months ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆159Updated last week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago
- LLM based AI Agent to automate Data Analysis for dbt projects☆115Updated last week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆143Updated last year
- Creates simple data models on Snowflake to report dbt source freshness and tests☆26Updated 2 years ago
- Data product portal created by Dataminded☆186Updated this week
- ☆42Updated 3 years ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆277Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated last month
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆366Updated last week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆372Updated last month
- Showcase of advanced use cases relating to CI in dbt☆83Updated this week
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆122Updated 5 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆202Updated 3 weeks ago
- Template for a data contract used in a data mesh.☆471Updated last year
- prefect integration for running dbt☆62Updated 10 months ago
- A dbt package from SELECT to help you monitor Snowflake performance and costs☆242Updated 2 weeks ago
- ☆132Updated 11 months ago
- Project for "Data pipeline design patterns" blog.☆45Updated 11 months ago
- ☆140Updated 7 months ago
- Delta Lake helper methods in PySpark☆324Updated 10 months ago