ajithshetty / data-engineering-rust-demo
Rust And Delta Demo. Explanation and walkthrough on delta-rs
☆10Updated last year
Alternatives and similar repositories for data-engineering-rust-demo:
Users that are interested in data-engineering-rust-demo are comparing it to the libraries listed below
- dbt-databend adapter plugin☆10Updated 8 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆24Updated 11 months ago
- Pythonic Iceberg REST Catalog☆72Updated 4 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆83Updated last month
- Apache Spark Connect Client for Rust☆96Updated 3 weeks ago
- ☆26Updated last month
- Repo for CDC with debezium blog post☆28Updated 4 months ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated 11 months ago
- Mock streaming data generator☆16Updated 8 months ago
- Cost Efficient Data Pipelines with DuckDB☆48Updated 6 months ago
- ☆15Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆90Updated 10 months ago
- ☆34Updated 10 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 7 months ago
- ☆36Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆48Updated 3 years ago
- The native Rust implementation for Apache Hudi, with Python API bindings.☆189Updated this week
- Utility functions for dbt projects running on Spark☆31Updated last week
- A custom end-to-end analytics platform for customer churn☆10Updated last week
- ☆12Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆23Updated 5 months ago
- Quick Guides from Dremio on Several topics☆67Updated 2 weeks ago
- ☆11Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆49Updated last year
- Apache DataFusion Ray☆151Updated last week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆155Updated 2 months ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆18Updated 9 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago