awslabs / dqdlLinks
☆17Updated last month
Alternatives and similar repositories for dqdl
Users that are interested in dqdl are comparing it to the libraries listed below
Sorting:
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- ☆19Updated last month
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- ☆32Updated last year
- ☆37Updated 2 weeks ago
- Make your complete search architecture serverless by keeping the Lucene index in AWS S3 and search requests through AWS Lambda☆9Updated 7 years ago
- Amazon EMR on EKS Custom Image CLI☆31Updated 8 months ago
- Apache Hive Metastore in Standalone Mode With Docker☆13Updated 10 months ago
- Unity Catalog UI☆40Updated 8 months ago
- A Apache Hive SerDe (short for serializer/deserializer) for the Ion file format.☆31Updated 2 months ago
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 3 weeks ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- ☆21Updated last year
- A Python Client for Hive Metastore☆12Updated last year
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆25Updated 6 months ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 7 months ago
- Java bindings for the Cedar language☆58Updated 2 months ago
- ☆27Updated 2 months ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated last month
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated 3 weeks ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 9 months ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 4 months ago
- A curated list of Apache Pulsar resources☆13Updated 6 years ago
- AWS Quick Start Team☆19Updated 8 months ago
- minio as local storage and DynamoDB as catalog☆15Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆40Updated this week
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated 2 years ago