hortonworks-spark/spark-schema-registry

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hortonworks-spark/spark-schema-registry)

hortonworks-spark / spark-schema-registry

Schema Registry integration for Apache Spark

☆40

Alternatives and similar repositories for spark-schema-registry

Users that are interested in spark-schema-registry are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
chermenin / spark-states
View on GitHub
Custom state store providers for Apache Spark
☆92Feb 14, 2025Updated last year
hurtn / databricks
View on GitHub
☆12Aug 6, 2020Updated 5 years ago
BenFradet / spark-kafka-writer
View on GitHub
Write your Spark data to Kafka seamlessly
☆172Jul 10, 2024Updated 2 years ago
zheyuan28 / SparkTaskMetrics
View on GitHub
Task Metrics Explorer
☆14Apr 2, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CodeRayZhang / Spark-Example
View on GitHub
Spark1.6和spark2.2的示例，包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
☆15Jan 28, 2018Updated 8 years ago
hazelcast / big-data-benchmark
View on GitHub
☆14Jun 30, 2026Updated 3 weeks ago
mkuthan / example-spark-kafka
View on GitHub
Apache Spark and Apache Kafka integration example
☆122Dec 21, 2017Updated 8 years ago
polomarcus / Spark-Structured-Streaming-Examples
View on GitHub
Spark Structured Streaming / Kafka / Cassandra / Elastic
☆186Feb 7, 2023Updated 3 years ago
databricks / spark-pr-dashboard
View on GitHub
Dashboard to aid in Spark pull request reviews
☆55Mar 30, 2023Updated 3 years ago
arskov / multipart-x-mixed-replace-java-player
View on GitHub
Motion JPEG (multipart/x-mixed-replace) stream player in Java
☆11Oct 23, 2020Updated 5 years ago
siddhi-io / siddhi-io-kafka
View on GitHub
Extension that can be used to receive events from a Kafka cluster and to publish events to a Kafka cluster
☆18May 13, 2026Updated 2 months ago
hammerlab / spark-util
View on GitHub
low-level helpers for Apache Spark libraries and tests
☆16Dec 29, 2018Updated 7 years ago
RADAR-base / MongoDb-Sink-Connector
View on GitHub
Kafka MongoDb sink connector
☆19Jul 23, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hammerlab / spark-json-relay
View on GitHub
SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.
☆16Apr 6, 2021Updated 5 years ago
maropu / spark-tpcds-datagen
View on GitHub
All the things about TPC-DS in Apache Spark
☆111Jun 15, 2023Updated 3 years ago
ansrivas / spark-structured-streaming
View on GitHub
Spark structured streaming with Kafka data source and writing to Cassandra
☆62Dec 5, 2019Updated 6 years ago
metamx / druid-spark-batch
View on GitHub
Druid indexing plugin for using Spark in batch jobs
☆102Oct 21, 2021Updated 4 years ago
apache / bahir
View on GitHub
Mirror of Apache Bahir
☆336Jul 7, 2023Updated 3 years ago
yahoo / maha
View on GitHub
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
☆132Jan 17, 2025Updated last year
sparsecode / DaFlow
View on GitHub
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…
☆26Jun 7, 2021Updated 5 years ago
bigbug / vscode-language-jsonata
View on GitHub
☆13Jan 6, 2024Updated 2 years ago
microsoft / Attestation-Client-Samples
View on GitHub
☆16Nov 18, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hortonworks-spark / cloud-integration
View on GitHub
Spark cloud integration: tests, cloud committers and more
☆20Jan 30, 2025Updated last year
apache / arrow-site
View on GitHub
Apache Arrow Website
☆40Updated this week
amplab / drizzle-spark
View on GitHub
Drizzle integration with Apache Spark
☆120Sep 11, 2018Updated 7 years ago
mattroberts297 / slf4s
View on GitHub
A Simple Logging Facade for Scala
☆15Jun 17, 2019Updated 7 years ago
criteo / cluster-pack
View on GitHub
A library on top of either pex or conda-pack to make your Python code easily available on a cluster
☆47Feb 4, 2026Updated 5 months ago
liquidm / druid-dumbo
View on GitHub
☆21Mar 17, 2023Updated 3 years ago
rxin / TPC-H-Hive
View on GitHub
Running TPC-H on Apache Hive
☆41Jul 15, 2019Updated 7 years ago
milinda / samza-sql
View on GitHub
SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆30Jun 8, 2016Updated 10 years ago
zero323 / pyspark-stubs
View on GitHub
Apache (Py)Spark type annotations (stub files).
☆118Aug 17, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
harupy / sphinx-plotly-directive
View on GitHub
A directive for including a Plotly figure in a Sphinx document.
☆18Oct 25, 2022Updated 3 years ago
joshlemer / MultiIndex
View on GitHub
A Scala Collection for Multiple Access Patterns
☆12Oct 22, 2016Updated 9 years ago
JohnReedLOL / pos
View on GitHub
Macro based print debugging for Scala code. Locates debug statements in your IDE. Supports logging.
☆23Oct 27, 2020Updated 5 years ago
leventerguder / injavawetrust.coffeeshop
View on GitHub
injavawetrust.coffeeshop
☆11Sep 9, 2021Updated 4 years ago
jpzk / kafcache
View on GitHub
Kafka Streams + Memcached (e.g. AWS ElasticCache) for low-latency in-memory lookups
☆13Nov 4, 2019Updated 6 years ago
elastacloud / spark-excel
View on GitHub
A Spark data source for reading Microsoft Excel files
☆13Jul 1, 2024Updated 2 years ago
apache / spark-connect-swift
View on GitHub
Apache Spark Connect Client for Swift
☆31Updated this week