Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
☆173Mar 6, 2021Updated 5 years ago
Alternatives and similar repositories for spark-dynamodb
Users that are interested in spark-dynamodb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DynamoDB data source for Apache Spark☆95Sep 2, 2021Updated 4 years ago
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Jan 15, 2026Updated 2 months ago
- Single node, in-memory DataFrame analytics library.☆43Mar 6, 2026Updated 2 weeks ago
- A Serverless function for posting to a Slack Webhook in response to a Mailgun route☆11Oct 12, 2016Updated 9 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Jan 10, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- ☆16Apr 25, 2019Updated 6 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- A lightweight Scala DSL for system testing REST web services☆24Jun 19, 2014Updated 11 years ago
- Kinesis Connector for Structured Streaming☆139Jul 2, 2024Updated last year
- Project to concentrate files and settings for AWS EMR monitoring. Source: https://aws.amazon.com/blogs/big-data/monitor-and-optimize-anal…☆15Oct 11, 2024Updated last year
- Performant Redshift data source for Apache Spark☆140Mar 17, 2026Updated last week
- unix domain sockets that look just like tcp sockets☆11Jun 21, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Oct 28, 2019Updated 6 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- This repo demonstrates how to use AWS application auto-scaling to implement custom-scaling in your Kinesis Data Analytics for Apache Flin…☆19Feb 21, 2025Updated last year
- Serverless sign-up to Slack (and other services) with Serverless.com☆30Nov 2, 2018Updated 7 years ago
- Spark data source for Salesforce☆80May 23, 2024Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆184Oct 2, 2025Updated 5 months ago
- Github Actions support for building SBT projects☆14Feb 9, 2021Updated 5 years ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Oct 21, 2019Updated 6 years ago
- Utility generating avro files from postgres☆17Jul 9, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Nov 4, 2024Updated last year
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Oct 30, 2018Updated 7 years ago
- ☆12Feb 25, 2026Updated last month
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- One enum type class to rule them all☆30Mar 17, 2026Updated last week
- an impala client for ruby☆34Jan 25, 2017Updated 9 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,596Updated this week
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 5 years ago
- Image Occlusion addon for Anki☆31Dec 5, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ideas and demonstrations of named tuples to the max☆29Apr 10, 2025Updated 11 months ago
- Project to build WebLogic Domains with Oracle Fusion Middleware 12c components using scripts.☆12Jul 13, 2018Updated 7 years ago
- Pipeline to build, test and deploy Serverless Framework Projects with CodeBuild and CodePipeline on AWS using Terraform.☆43Mar 12, 2019Updated 7 years ago
- Explore the use of different patterns to produce clean code☆21Oct 11, 2014Updated 11 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Feb 21, 2024Updated 2 years ago
- Eval ruby code on filtering☆13Dec 15, 2015Updated 10 years ago
- Capistrano with rsync to deployment hosts from local repository.☆19May 21, 2019Updated 6 years ago