awslabs / dqdlLinks
☆22Updated 2 months ago
Alternatives and similar repositories for dqdl
Users that are interested in dqdl are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 weeks ago
- A leightweight UI for Lakekeeper☆16Updated this week
- Multi-hop declarative data pipelines☆122Updated this week
- Experimental version. A BYOC option for Snowflake workloads☆100Updated this week
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆65Updated last month
- Spark Accelerator framework ; It enables secondary indices to remote data stores.☆39Updated 3 weeks ago
- Apache flink☆23Updated 2 weeks ago
- 🗃 Automate periodic data operations, such as deleting indices at a certain age or performing a rollover at a certain size☆72Updated this week
- Generated Kafka protocol implementations☆34Updated last week
- Resilient data pipeline framework running on Apache Spark☆25Updated last week
- Amundsen Gremlin☆21Updated 3 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆29Updated 2 weeks ago
- Paper: A Zero-rename committer for object stores☆20Updated 2 months ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆65Updated 3 weeks ago
- Java bindings for the Cedar language☆65Updated last week
- A tool to benchmark L (loading) workloads within ETL workloads☆30Updated this week
- Apache iceberg Spark s3 examples☆20Updated last year
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 4 months ago
- ☆22Updated last year
- Compaction runtime for Apache Iceberg.☆113Updated this week
- Lightweight storage for Trino views☆16Updated 3 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- 📈 Get detailed performance metrics from your cluster independently of the Java Virtual Machine (JVM)☆46Updated 2 weeks ago
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆39Updated 3 weeks ago
- ☆32Updated last month
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated 5 months ago
- A one-afternoon implementation of redis-like primitives with S3 Express☆33Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆85Updated last year
- Apache datasketches☆102Updated 3 weeks ago
- The Performance Analyzer RCA is a framework that builds on the Performance Analyzer engine to support root cause analysis (RCA) of perfor…☆34Updated 2 months ago