anna-anisienia / data-discovery-api
☆14Updated 4 years ago
Alternatives and similar repositories for data-discovery-api:
Users that are interested in data-discovery-api are comparing it to the libraries listed below
- Using the Parquet file format with Python☆15Updated last year
- AWS Quick Start Team☆18Updated 6 months ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- ☆11Updated 4 months ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated 8 months ago
- Pandas helper functions☆30Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆20Updated 5 months ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- This Script gets CSV file from Amazon S3 using Python Library Boto3 and converts it to Parquet Format before uploading the new Parquet Ve…☆9Updated 4 years ago
- ☆13Updated 5 months ago
- Samples and documentation for various advertising and marketing use cases on AWS.☆35Updated last year
- ☆53Updated last year
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆50Updated last year
- ☆10Updated 3 years ago
- ☆13Updated 2 weeks ago
- ☆16Updated 2 years ago
- Dask on ECS Fargate☆14Updated 5 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Updated this week
- This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…☆17Updated 4 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 7 months ago
- ☆30Updated last year
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆12Updated 6 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago