spark-redshift-community/spark-redshift

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spark-redshift-community/spark-redshift)

spark-redshift-community / spark-redshift

Performant Redshift data source for Apache Spark

☆140

Alternatives and similar repositories for spark-redshift

Users that are interested in spark-redshift are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

databricks / spark-redshift
View on GitHub
Redshift data source for Apache Spark
☆608Aug 10, 2023Updated 2 years ago
aws-samples / emr-spark-benchmark
View on GitHub
☆26Apr 26, 2026Updated 2 months ago
buildkite / python-pipenv-example
View on GitHub
An example pipeline that tests a Python project using pipenv for dependency management.
☆16Apr 14, 2026Updated 3 months ago
aws-samples / spark-streaming-sql-s3-connector
View on GitHub
An Apache Spark Structured Streaming S3 connector for reading S3 files using Amazon S3 event notifications to AWS SQS
☆16Feb 13, 2024Updated 2 years ago
aws-samples / amazon-kinesis-data-analytics-flink-benchmarking-utility
View on GitHub
Amazon Managed Service for Apache Flink Benchmarking Utility helps with capacity planning, integration testing, and benchmarking of Amazo…
☆21Aug 30, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
acrlabs / kube-scheduler-rs-reference
View on GitHub
A reference implementation of a Kubernetes scheduler written in Rust
☆12Mar 4, 2024Updated 2 years ago
broxtronix / spark-gce
View on GitHub
A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
audienceproject / spark-dynamodb
View on GitHub
Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
☆174Mar 6, 2021Updated 5 years ago
aws / aws-emr-best-practices
View on GitHub
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…
☆110Apr 5, 2026Updated 3 months ago
awslabs / amazon-redshift-utils
View on GitHub
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
☆2,812Sep 3, 2025Updated 10 months ago
qubole / s3-sqs-connector
View on GitHub
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
☆19Apr 20, 2024Updated 2 years ago
aws-samples / amazon-kinesis-analytics-streaming-etl
View on GitHub
Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics
☆65Oct 17, 2023Updated 2 years ago
aws-samples / amazon-redshift-config-compare
View on GitHub
☆25Oct 12, 2023Updated 2 years ago
HurSungYun / kafbat-ui-serde-protobuf-descriptor
View on GitHub
A protobuf serde for Kafbat UI using a protobuf descriptor file
☆17Dec 4, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
awslabs / aws-glue-data-catalog-client-for-apache-hive-metastore
View on GitHub
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…
☆230May 18, 2026Updated last month
colbyford / sparkitecture
View on GitHub
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
☆13Oct 27, 2021Updated 4 years ago
sqlalchemy-redshift / sqlalchemy-redshift
View on GitHub
Amazon Redshift SQLAlchemy Dialect
☆228Apr 28, 2026Updated 2 months ago
aws-samples / emr-studio-samples
View on GitHub
This repo contains samples for EMR Studio feature.
☆21Nov 15, 2022Updated 3 years ago
rssanders3 / airflow-spark-operator-plugin
View on GitHub
A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
☆73Sep 20, 2019Updated 6 years ago
opencredo / mesos_service_discovery
View on GitHub
Service Discovery script for Mesos and Marathon
☆15Oct 9, 2014Updated 11 years ago
aws-samples / aws-emr-apache-ranger
View on GitHub
☆24Oct 3, 2023Updated 2 years ago
awsdocs / amazon-redshift-developer-guide
View on GitHub
This is the documentation for the Amazon Redshift Developer Guide
☆120Jun 15, 2023Updated 3 years ago
astronomer / dynamic-dags-tutorial
View on GitHub
☆31Jul 7, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
getsentry / sentry-spark
View on GitHub
Apache Spark Sentry Integration
☆16Aug 13, 2021Updated 4 years ago
aws / aws-kinesisanalytics-runtime
View on GitHub
This library contains the Kinesis Analytics stream processing runtime configuration classes.
☆11Jan 26, 2026Updated 5 months ago
YotpoLtd / metorikku
View on GitHub
A simplified, lightweight ETL Framework based on Apache Spark
☆588Jan 24, 2024Updated 2 years ago
aws-samples / aws-concurrent-data-orchestration-pipeline-emr-livy
View on GitHub
This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…
☆76Oct 30, 2018Updated 7 years ago
stealthly / punxsutawney
View on GitHub
An Apache Mesos Framework that allows for replaying load over and over and over (and over) again
☆10Aug 10, 2015Updated 10 years ago
microsoft / install-databricks-cli
View on GitHub
GitHub Action that installs Databricks CLI
☆14Sep 22, 2021Updated 4 years ago
getlantern / notifier
View on GitHub
A library for sending native desktop notifications from Go
☆14Aug 30, 2024Updated last year
brndnmtthws / kafka-on-marathon
View on GitHub
Scripts for running Apache Kafka on Mesosphere's Marathon
☆14Dec 6, 2015Updated 10 years ago
fleetio / dbt-segment
View on GitHub
Data models for Segment built using dbt (getdbt.com).
☆12Jul 31, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
astronomer / ap-airflow
View on GitHub
Astronomer Core Docker Images
☆105May 22, 2024Updated 2 years ago
aws-samples / aws-glue-flatten-nested-json
View on GitHub
☆51Aug 9, 2022Updated 3 years ago
lightbend / flink-operator
View on GitHub
Helm Chart for lyft/flinkk8soperator
☆11Mar 10, 2020Updated 6 years ago
jbarrasa / openpermid2neo4j
View on GitHub
importing Thomson Reuters' permID dataset into Neo4j
☆19Feb 1, 2018Updated 8 years ago
PBWebMedia / airflow-prometheus-exporter
View on GitHub
Export Airflow metrics (from mysql) in prometheus format
☆29Jun 12, 2026Updated last month
aws / amazon-redshift-jdbc-driver
View on GitHub
Redshift JDBC Driver. It supports JDBC 4.2 specification.
☆71Jul 7, 2026Updated last week
mkrn / appsync-cloudformation-quick-start
View on GitHub
AWS AppSync CloudFormation Quick Start Template
☆12Dec 19, 2018Updated 7 years ago