Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR
☆17Apr 27, 2025Updated 11 months ago
Alternatives and similar repositories for amazon-emr-with-delta-lake
Users that are interested in amazon-emr-with-delta-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Amazon Redshift Serverless RSQL ETL Framework☆10Apr 1, 2025Updated last year
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 5 months ago
- Replication utility for AWS Glue Data Catalog☆79Aug 8, 2024Updated last year
- Example code for running Spark and Hive jobs on EMR Serverless.☆169Mar 11, 2026Updated last month
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A full-featured command line interface (CLI) for Open Distro.☆24Jan 11, 2022Updated 4 years ago
- ☆18Jun 16, 2024Updated last year
- Scalable analytics using Apache Druid on AWS is a solution offered by AWS that enables customers to quickly and efficiently deploy, opera…☆25Jul 30, 2025Updated 8 months ago
- Toy JVM is written in Rust☆13Jan 25, 2021Updated 5 years ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆36Feb 15, 2025Updated last year
- ☆33Mar 20, 2024Updated 2 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Jul 13, 2022Updated 3 years ago
- ☆18Dec 2, 2025Updated 4 months ago
- This guidance creates a scalable environment in AWS to prepare genomic, clinical, mutation, expression and imaging data for large-scale a…☆24Jun 30, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Python library & CLI to create, view and edit PFB files☆13Feb 19, 2026Updated last month
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆32Apr 13, 2023Updated 3 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆33Mar 26, 2026Updated 3 weeks ago
- Repository for the paper "Discovering and Categorising Language Biases in Reddit" accepted at the International Conference on Web and Soc…☆12Aug 20, 2024Updated last year
- ☆12Jan 30, 2024Updated 2 years ago
- ☆14May 19, 2023Updated 2 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Feb 13, 2026Updated 2 months ago
- A Singer.io Target for Snowflake☆11Jun 9, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A simple python SDK around PubMed API.☆21Jan 1, 2025Updated last year
- This is a simple demo for integrating Authing in AWS China region to protect API Gateway REST API and other AWS resources such as IoT, Po…☆12Oct 12, 2024Updated last year
- Ascertained Sequentially Markovian Coalescent☆16Oct 22, 2025Updated 5 months ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated 9 months ago
- Automated Machine Learning for Environmental Data-Driven Genome Prediction☆14Sep 12, 2025Updated 7 months ago
- A dotnet standard wrapper for the Uniswap V2 Subgraph on The Graph GraphQL API.☆12Dec 17, 2020Updated 5 years ago
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆27Apr 7, 2026Updated last week
- ☆44Aug 14, 2024Updated last year
- ☆12Feb 18, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Mar 4, 2025Updated last year
- Airstack Subgraphs☆11Jul 2, 2024Updated last year
- A Python client for managing connectors using the Kafka Connect API.☆12Oct 30, 2025Updated 5 months ago
- The Genomics Tertiary Analysis and Machine Learning Using Amazon SageMaker solution creates a scalable environment in AWS to develop mach…☆11Jul 7, 2023Updated 2 years ago
- Demo for scalable Elasticsearch setups with Frozen Indices, Index Lifecycle Management, and Rollups☆12Oct 17, 2020Updated 5 years ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated 10 months ago
- ☆16Feb 19, 2025Updated last year