anna-anisienia / data-discovery-apiLinks
☆15Updated 4 years ago
Alternatives and similar repositories for data-discovery-api
Users that are interested in data-discovery-api are comparing it to the libraries listed below
Sorting:
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated last month
- Using the Parquet file format with Python☆15Updated last year
- Simple samples for writing ETL transform scripts in Python☆23Updated 3 years ago
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆13Updated 6 years ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆51Updated last year
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 9 months ago
- ☆11Updated 7 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- ☆13Updated last week
- Showcases the AsyncIO Functionality within Apache Flink for Kinesis Data Analytics☆10Updated 5 months ago
- ☆31Updated last year
- ☆14Updated 4 years ago
- Customizable GitOps template for Kubeflow on AWS EKS☆10Updated 4 years ago
- ☆16Updated 2 years ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated 2 months ago
- AWS Quick Start Team☆19Updated 8 months ago
- Describes the concepts of lambda architecture and the actual deployment process with an example of building a serverless business intelli…☆15Updated 2 weeks ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Updated 4 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- Samples and documentation for various advertising and marketing use cases on AWS.☆36Updated 2 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 5 months ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated last year
- ☆15Updated 8 months ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- ☆10Updated 9 months ago