glassflow / clickhouse-etl
Real-time deduplication and temporal joins for streaming data
☆27Updated this week
Alternatives and similar repositories for clickhouse-etl:
Users that are interested in clickhouse-etl are comparing it to the libraries listed below
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆26Updated 2 weeks ago
- Python package for querying iceberg data through duckdb.☆65Updated last year
- Docker envinroment to stream data from Kafka to Iceberg tables☆28Updated last year
- Snowflake bring-your-own-cloud option. Run Snowflake as a microservice on your own compute☆57Updated this week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆51Updated 3 weeks ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆61Updated 2 years ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆53Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆18Updated 4 months ago
- A Rust based data/CSV/Parquet file generator☆52Updated 2 months ago
- A Python Client for Hive Metastore☆12Updated last year
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated last month
- Yet Another (Spark) ETL Framework☆21Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆13Updated 9 months ago
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- Unity Catalog UI☆40Updated 8 months ago
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆12Updated 4 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- dbt (data build tool) adapter for the Dremio☆51Updated this week
- DuckDB Pyroscope Extension for Continuous Profiling☆17Updated last month
- Tutorials, templates for running glassflow pipelines☆30Updated 2 months ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆73Updated 2 months ago
- ☆34Updated 3 weeks ago
- Lambda function to serverlessly repartition parquet files in S3☆35Updated last month
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated last year
- Mock streaming data generator☆17Updated 11 months ago
- MCP Server for Trino developed via MCP Python SDK☆15Updated last week
- Use dbt to manage real-time data transformations in RisingWave.☆25Updated last month
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 8 months ago