Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data Platform implementations on the AWS Cloud.
☆24Sep 6, 2023Updated 2 years ago
Alternatives and similar repositories for aws-glue-apache-hudi-operational-data-processing-framework
Users that are interested in aws-glue-apache-hudi-operational-data-processing-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Replication utility for AWS Glue Data Catalog☆79Aug 8, 2024Updated last year
- ☆25Jul 4, 2023Updated 2 years ago
- ☆16Jun 14, 2023Updated 2 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆69Aug 9, 2022Updated 3 years ago
- AppSync Events frontend sample implementation☆12Nov 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14May 2, 2024Updated last year
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆101Aug 12, 2022Updated 3 years ago
- ☆15Aug 12, 2025Updated 8 months ago
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- ☆11Jun 12, 2023Updated 2 years ago
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆39Updated this week
- ☆17Mar 26, 2017Updated 9 years ago
- ☆72Jun 3, 2024Updated last year
- Open source tutorial-based Machine Learning framework☆16Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Provides Pydantic pre-built model for all boto3 stubs & validators + generators for AWS resources.☆12Sep 4, 2025Updated 7 months ago
- aws lamda fastapi with serverless☆17Dec 1, 2024Updated last year
- Newsletter Backend Orchestration Template in AWS with SST☆15Jun 20, 2024Updated last year
- Spark app to merge different schemas☆23Dec 21, 2020Updated 5 years ago
- How to have private platform serverless APIs communicating securely internally within your organisations without needing to traverse the …☆11Dec 15, 2021Updated 4 years ago
- Frontend app to go with the backend Cognito demos☆14Mar 19, 2023Updated 3 years ago
- Example of using Amazon EventBridge OpenAPI schemas with TypeScript and the AWS CDK☆14Oct 10, 2022Updated 3 years ago
- ☆16Sep 18, 2023Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Autobot is your nice and friendly bot. He is pluggable via Pub/Sub and written in @golang.☆13Mar 7, 2023Updated 3 years ago
- Interactive Elasticsearch Analyzer☆13Dec 8, 2022Updated 3 years ago
- An example of event sourcing and CQRS in serverless, with code examples in TypeScript and the AWS CDK.☆11Apr 24, 2024Updated last year
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆91Dec 29, 2022Updated 3 years ago
- Решение первой задачи соревнования Tinkoff Data Science Challenge. Выложу только лучшую GBM модель из ансамбля, дает CV .7780, паблик при…☆22Apr 26, 2017Updated 8 years ago
- Meetup Organisation☆10Oct 12, 2018Updated 7 years ago
- Enterprise-grade, production-hardened, serverless data lake on AWS☆479Oct 1, 2025Updated 6 months ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 10 years ago
- Monitoring and insights on your data lakehouse tables☆32Apr 3, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A script that gets data from the Twitter real-time API, passes it to a message-queue (e.g. RabbitMQ) and stores tweets into MongoDB☆11Apr 20, 2017Updated 8 years ago
- Demo to try out gRPC with NodeJS gRPC client and Golang gRPC server☆14Sep 1, 2021Updated 4 years ago
- An opinionated discussion around how to set up, structure, and deploy your AWS CDK Serverless apps using CDK Pipelines in line with AWS b…☆13Mar 4, 2023Updated 3 years ago
- ☆45Apr 4, 2026Updated last week
- Using TypeScript and the AWS CDK, you can integrate Knowledge Bases into Amazon Bedrock to provide foundation models with contextual data…☆14May 9, 2024Updated last year
- CDK Construct for creating a bastion host to forward a connection to several AWS data services inside a private subnet from your local ma…☆31Updated this week
- Salary report visualization with D3.js☆30Dec 6, 2021Updated 4 years ago