datayoga-io / datayogaLinks
streaming data pipeline platform
☆29Updated last month
Alternatives and similar repositories for datayoga
Users that are interested in datayoga are comparing it to the libraries listed below
Sorting:
- Generate beautiful documentation for your data pipelines in markdown format☆28Updated 4 years ago
- Python bindings for sqlparser-rs☆201Updated 8 months ago
- Run, mock and test fake Snowflake databases locally.☆169Updated last week
- Python wrapper for the Sling CLI tool☆63Updated last month
- Arrow Flight SQL Server☆125Updated 7 months ago
- A Postgres Proxy Server in Python☆316Updated last year
- ☆81Updated 11 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆183Updated last month
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Updated last year
- 🚀 GizmoSQL — High-Performance SQL Server☆289Updated this week
- Read Apache Arrow batches from ODBC data sources in Python☆74Updated 3 weeks ago
- ☆116Updated last year
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆38Updated last week
- ☆70Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 3 weeks ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆117Updated 11 months ago
- Turning PySpark Into a Universal DataFrame API☆485Updated last week
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated 2 years ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆191Updated last week
- ☆30Updated last year
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆180Updated this week
- GigAPI is a Timeseries lakehouse for real-time data and sub-second queries, powered by DuckDB OLAP + Parquet Query Engine, Compactor w/ C…☆376Updated 3 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆107Updated last week
- Write your dbt models using Ibis☆75Updated 10 months ago
- Making DAG construction easier☆283Updated last month
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆332Updated 2 years ago
- ☆376Updated this week
- Write 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com☆116Updated this week
- Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM☆554Updated 2 weeks ago
- ☆65Updated 9 months ago