Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
☆173Mar 6, 2021Updated 5 years ago
Alternatives and similar repositories for spark-dynamodb
Users that are interested in spark-dynamodb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DynamoDB data source for Apache Spark☆95Sep 2, 2021Updated 4 years ago
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Apr 8, 2026Updated 3 weeks ago
- Single node, in-memory DataFrame analytics library.☆44Mar 6, 2026Updated last month
- A Serverless function for posting to a Slack Webhook in response to a Mailgun route☆11Oct 12, 2016Updated 9 years ago
- Serverless function to automate enforcement of Multi-Factor Authentication (MFA) to all AWS IAM users with access to AWS Management Conso…☆13Oct 30, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Jan 10, 2019Updated 7 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- ☆16Apr 25, 2019Updated 7 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 5 months ago
- Aa Rust-based command-line interface (CLI) and library for controlling Twinkly programmable LED controllers.☆15Feb 3, 2025Updated last year
- A lightweight Scala DSL for system testing REST web services☆24Jun 19, 2014Updated 11 years ago
- Kinesis Connector for Structured Streaming☆138Jul 2, 2024Updated last year
- Project to concentrate files and settings for AWS EMR monitoring. Source: https://aws.amazon.com/blogs/big-data/monitor-and-optimize-anal…☆15Oct 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Performant Redshift data source for Apache Spark☆140Mar 17, 2026Updated last month
- A library for Spark DataFrame using MinIO Select API☆102Sep 27, 2019Updated 6 years ago
- unix domain sockets that look just like tcp sockets☆11Jun 21, 2018Updated 7 years ago
- ☆11Oct 28, 2019Updated 6 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- This repo demonstrates how to use AWS application auto-scaling to implement custom-scaling in your Kinesis Data Analytics for Apache Flin…☆19Feb 21, 2025Updated last year
- ☆18Nov 4, 2024Updated last year
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- (Original: code.google.com/p/pylinda) A Linda implementation in Python.☆12Jun 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Jun 14, 2014Updated 11 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- One enum type class to rule them all☆30Apr 13, 2026Updated 3 weeks ago
- an impala client for ruby☆34Jan 25, 2017Updated 9 years ago
- ☆10Apr 8, 2020Updated 6 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,614Updated this week
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 5 years ago
- WebGL / Web Audio API interface for listening to the CIPIC HRTF database☆52Dec 29, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Ideas and demonstrations of named tuples to the max☆29Apr 10, 2025Updated last year
- Explore the use of different patterns to produce clean code☆21Oct 11, 2014Updated 11 years ago
- An example of MLflow Tracking and Models Using Factorization Machine Recommender model library, rankfm.☆10Sep 9, 2021Updated 4 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Feb 21, 2024Updated 2 years ago
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆26Feb 14, 2021Updated 5 years ago
- Apache YuniKorn Scheduler Interface☆34Apr 8, 2026Updated 3 weeks ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆614Jun 5, 2023Updated 2 years ago