anna-anisienia / data-discovery-api
☆14Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for data-discovery-api
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 2 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Updated 3 months ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- Dask on ECS Fargate☆14Updated 5 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Updated last week
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆48Updated last year
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆12Updated 6 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆10Updated this week
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- Serverless Datalake architecture☆12Updated last year
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- A project template for developing BYOD docker images for use in Amazon SageMaker.☆19Updated 4 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 5 years ago
- The objective of Cloud Builders' Day repository is to provide do-it-yourself lab guides for several AWS services including but not limite…☆11Updated 4 years ago
- ☆19Updated 4 years ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated 6 months ago
- ☆28Updated 8 months ago
- ☆31Updated 8 months ago
- Using the Parquet file format with Python☆14Updated last year
- ☆11Updated last month
- Template for a modular, Python-based data science project.☆34Updated 7 months ago
- aws-solutions-library-samples / guidance-for-text-generation-using-embeddings-from-enterprise-data-on-awsThis Guidance demonstrates question answering using Retrieval Augmented Generation (RAG) with foundation models in Amazon SageMaker JumpS…☆9Updated last month
- Showcases the AsyncIO Functionality within Apache Flink for Kinesis Data Analytics☆10Updated last year
- ☆34Updated last year
- This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…☆17Updated 3 years ago
- ☆15Updated last year
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Updated 3 years ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆14Updated 3 months ago
- dbt / Amazon Redshift Demonstration Project☆33Updated last year