awslabs / dqdlLinks
☆22Updated 2 months ago
Alternatives and similar repositories for dqdl
Users that are interested in dqdl are comparing it to the libraries listed below
Sorting:
- Amundsen Gremlin☆21Updated 3 years ago
- A leightweight UI for Lakekeeper☆16Updated last week
- Dynamic Conformance Engine☆32Updated 3 months ago
- Lightweight storage for Trino views☆17Updated last week
- Multi-hop declarative data pipelines☆124Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆29Updated last month
- Generated Kafka protocol implementations☆34Updated 3 weeks ago
- Resilient data pipeline framework running on Apache Spark☆25Updated this week
- Spark Accelerator framework ; It enables secondary indices to remote data stores.☆39Updated this week
- A testing framework for Trino☆26Updated 10 months ago
- ☆40Updated 2 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆65Updated 2 months ago
- 🗃 Automate periodic data operations, such as deleting indices at a certain age or performing a rollover at a certain size☆73Updated this week
- MemQ is an efficient, scalable cloud native PubSub system☆140Updated 2 weeks ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆85Updated last year
- ☆32Updated last week
- Apache flink☆24Updated last month
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Updated 6 months ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆105Updated last week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- ☆22Updated last year
- A tool to benchmark L (loading) workloads within ETL workloads☆30Updated 3 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated 2 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated 2 years ago
- ☆18Updated 3 years ago
- Aerospike storage backend for Janusgraph☆32Updated last year
- ☆58Updated 3 weeks ago