airbnb/sputnik

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/airbnb/sputnik)

airbnb / sputnik

☆64

Alternatives and similar repositories for sputnik

Users that are interested in sputnik are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

10xfuturetechnologies / kafka-connect-iceberg
View on GitHub
Kafka Connector for Iceberg tables
☆16Jul 24, 2023Updated 3 years ago
wooplevip / sedis
View on GitHub
SQL for Redis
☆11Sep 16, 2022Updated 3 years ago
CallHandling / freeswitch-scala-esl
View on GitHub
A reactive event socket library for FreeSwitch written using Scala and Akka Streams
☆13Jul 9, 2024Updated 2 years ago
dstreev / hdp-data-gen
View on GitHub
Hortonworks Data Platform Data Generation Tool
☆13Nov 30, 2017Updated 8 years ago
zrlio / parquet-generator
View on GitHub
Parquet file generator
☆22Apr 17, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rwhitten577 / faust-example
View on GitHub
Example of using Faust with Docker
☆23Sep 30, 2019Updated 6 years ago
AbsaOSS / hyperdrive
View on GitHub
Extensible streaming ingestion pipeline on top of Apache Spark
☆47Jul 17, 2025Updated last year
Wheest / pytorch-lightning-cifar
View on GitHub
Common CNN models defined for PyTorch Lightning
☆10Jul 28, 2022Updated 3 years ago
ScalaConsultants / akka-periscope
View on GitHub
Akka plugin to collect various data about actors
☆17Aug 19, 2024Updated last year
kolotaev / ride
View on GitHub
Scala GUID generator for large systems
☆16Jul 6, 2023Updated 3 years ago
datasphere-oss / datasphere-service
View on GitHub
an open source dataworks platform
☆20Jun 4, 2021Updated 5 years ago
qubole / spark-state-store
View on GitHub
Rocksdb state storage implementation for Structured Streaming.
☆17Oct 21, 2020Updated 5 years ago
jdye64 / docker-hwx
View on GitHub
Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components
☆10Oct 11, 2019Updated 6 years ago
yamrcraft / etl-light
View on GitHub
A light Kafka to HDFS/S3 ETL library based on Apache Spark
☆40Jun 29, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
avast / rabbitmq-scala-client
View on GitHub
Scala wrapper over standard RabbitMQ Java client library
☆37Updated this week
swoop-inc / spark-records
View on GitHub
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
☆73Mar 14, 2021Updated 5 years ago
AbsaOSS / atum
View on GitHub
A dynamic data completeness and accuracy library at enterprise scale for Apache Spark
☆30May 13, 2026Updated 2 months ago
speedb-io / log-parser
View on GitHub
A tool for analyzing and parsing SpeedB and RocksDB log files
☆22Mar 31, 2024Updated 2 years ago
hortonworks-spark / spark-hive-streaming-sink
View on GitHub
A sink to save Spark Structured Streaming DataFrame into Hive table
☆23May 7, 2018Updated 8 years ago
knaufk / enrichments-with-flink
View on GitHub
Code Samples for my Ververica Webinar "99 Ways to Enrich Streaming Data with Apache Flink"
☆41Jan 4, 2022Updated 4 years ago
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
uber / marmaray
View on GitHub
Generic Data Ingestion & Dispersal Library for Hadoop
☆483Mar 19, 2023Updated 3 years ago
zeroc-ice / datastorm
View on GitHub
Data centric pub/sub framework based on Ice
☆13Oct 15, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Nexmo / comms-router
View on GitHub
A server which allows you to route tasks to agents.
☆21Jul 12, 2026Updated last week
Ctrip-DI / Hue-Ctrip-DI
View on GitHub
Ctrip Data Infrastructure team works for hue
☆16Dec 10, 2014Updated 11 years ago
youngwookim / awesome-presto
View on GitHub
A curated list of awesome PrestoDB / Trino software, libraries, tools and resources
☆18Jun 28, 2021Updated 5 years ago
qubole / spark-acid
View on GitHub
ACID Data Source for Apache Spark based on Hive ACID
☆97Jul 7, 2021Updated 5 years ago
avensolutions / spark-sql-etl-framework
View on GitHub
Multi-stage, config driven, SQL based ETL framework using PySpark
☆26Sep 16, 2019Updated 6 years ago
spilth / maven-book
View on GitHub
A book about Maven in the style of the Pragmatic Guides published by The Pragmatic Bookshelf
☆11Dec 12, 2015Updated 10 years ago
xavient / CDS
View on GitHub
Content Data Store (HDFS/HBase)
☆13Dec 1, 2016Updated 9 years ago
tugul / CoreJava
View on GitHub
Konzepte von Core-Java 8 werden durch beispiele illustriert. Java 8's core concepts are explained by examples.
☆12Oct 12, 2018Updated 7 years ago
rbrush / kite-apps
View on GitHub
Prescriptive Applications over Kite and Hadoop
☆12Oct 14, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SETL-Framework / setl
View on GitHub
A simple Spark-powered ETL framework that just works 🍺
☆186Oct 2, 2025Updated 9 months ago
jgperrin / net.jgp.labs.spark.datasources
View on GitHub
Building custom data sources for Apache Spark, in Java.
☆11Oct 12, 2020Updated 5 years ago
alexholmes / hsync
View on GitHub
HDFS rsync-like utility to replicate data between HDFS clusters
☆17Jun 16, 2012Updated 14 years ago
kokosing / trino-query-formatter
View on GitHub
Presto SQL query formatter
☆15Jan 1, 2024Updated 2 years ago
swoop-inc / spark-alchemy
View on GitHub
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
☆191Oct 15, 2025Updated 9 months ago
AMEE / AMON
View on GitHub
☆16Nov 27, 2012Updated 13 years ago
ExpediaGroup / datasqueeze
View on GitHub
Hadoop utility to compact small files
☆18Feb 16, 2026Updated 5 months ago