avriiil / stream-this-datasetLinks
Code to convert static datasets into simulated data streams
☆14Updated 2 years ago
Alternatives and similar repositories for stream-this-dataset
Users that are interested in stream-this-dataset are comparing it to the libraries listed below
Sorting:
- Supercharge BigQuery with BigFunctions☆748Updated 3 months ago
- ☆49Updated last year
- Airbyte made simple (no UI, no database, no cluster)☆183Updated 3 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated last month
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆185Updated 2 years ago
- Data product portal created by Dataminded☆190Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Airbyte deployment and configuration management tool☆12Updated 3 years ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆375Updated 3 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Demo of Streamlit application with Databricks SQL Endpoint☆34Updated 2 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆459Updated this week
- ☆135Updated last week
- Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆104Updated 2 years ago
- This package contains macros and models to find DAG issues automatically☆500Updated 3 weeks ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆477Updated 5 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆202Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆260Updated last month
- Food for thoughts around data contracts☆26Updated last month
- ☆155Updated last month
- PyAirbyte brings the power of Airbyte to every Python developer.☆293Updated this week
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆124Updated 7 months ago
- ☆119Updated last month
- A web extension to empower dbt users☆27Updated 3 years ago
- One-stop-shop for docs and test coverage of dbt projects.☆221Updated 3 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆148Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 5 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
- ☆183Updated last month