shirukai/spark-structured-datasource

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shirukai/spark-structured-datasource)

shirukai / spark-structured-datasource

Custom datasource about spark structure streaming

☆12

Alternatives and similar repositories for spark-structured-datasource

Users that are interested in spark-structured-datasource are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tspannhw / nifi-convertjsontoddl-processor
View on GitHub
Apache NiFi 1.5/1.6/1.9.2+ Processor to produce DDL
☆11Nov 16, 2022Updated 3 years ago
bluejoe2008 / spark-http-stream
View on GitHub
spark structured streaming via HTTP communication
☆18Jul 7, 2022Updated 4 years ago
gitter-badger / waterdrop
View on GitHub
An easy-to-use, scalable spark streaming ETL tool and sdk
☆14Aug 14, 2017Updated 8 years ago
ilovethepc / CDH_offline_install
View on GitHub
CDH6.3.2离线安装
☆11Nov 2, 2020Updated 5 years ago
piotr-kalanski / data-model-generator
View on GitHub
Data model generator based on Scala case classes
☆29Nov 5, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TBDSUDC / tbds_maintance_dev_demo_project
View on GitHub
☆15Apr 12, 2022Updated 4 years ago
japila-books / spark-structured-streaming-internals
View on GitHub
The Internals of Spark Structured Streaming
☆420Mar 3, 2026Updated 4 months ago
CyrusZhou-CN / openGauss_master_slave
View on GitHub
openGauss 数据库 docker compose ，patroni 自动主备切换，HAProxy 数据库读写负载均衡，测试环境
☆11May 16, 2022Updated 4 years ago
wankunde / sql-runner
View on GitHub
☆17Mar 19, 2024Updated 2 years ago
CodeRayZhang / Spark-Example
View on GitHub
Spark1.6和spark2.2的示例，包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
☆15Jan 28, 2018Updated 8 years ago
junk16 / spark-yarn-cluster
View on GitHub
Apache Spark on Apache Yarn 2.6.0 cluster Docker image
☆12Oct 18, 2017Updated 8 years ago
wojiushixiaobai / redis-sentinel
View on GitHub
redis-sentinel
☆20Aug 7, 2021Updated 4 years ago
jerryshao / spark-hive-streaming-sink
View on GitHub
A sink to save Spark Structured Streaming DataFrame into Hive table
☆29Apr 16, 2018Updated 8 years ago
spirom / spark-data-sources
View on GitHub
Developing Spark External Data Sources using the V2 API
☆49Apr 29, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shrekwang / object-inspector
View on GitHub
用来检测java对象占用内存情况的小工具
☆16Mar 1, 2013Updated 13 years ago
sderosiaux / parquet-custom-reader-writer
View on GitHub
Simple implementation of a custom parquet reader/writer
☆11Aug 12, 2016Updated 9 years ago
ilyaglow / dor
View on GitHub
🌦️ Domain Ranker
☆16Sep 7, 2019Updated 6 years ago
wfxiang08 / rpc_proxy
View on GitHub
基于thrift的服务注册和发现框架
☆13Oct 9, 2017Updated 8 years ago
alexdebrie / aws-api-performance-bakeoff
View on GitHub
Code and architecture diagrams for performance testing a few API approaches on AWS
☆10Apr 20, 2019Updated 7 years ago
frankyu8 / ushas
View on GitHub
This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical …
☆20Jan 7, 2022Updated 4 years ago
sev7e0 / wow-spark
View on GitHub
spark自学手册，包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake，以及scala基础练习，还有一些例如master、shuﬄe源码分析，总结及翻译。
☆18Jul 19, 2023Updated 3 years ago
mshtelma / spark-structured-streaming-jdbc-sink
View on GitHub
Spark Structured Streaming JDBC Sink
☆16Apr 26, 2021Updated 5 years ago
brianm / jp
View on GitHub
Like jq, but with json pointers
☆16Nov 30, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
philwinder / mesos-terraform
View on GitHub
Create an Amazon AWS Mesos cluster using Terraform
☆12Feb 15, 2017Updated 9 years ago
gdtm86 / spark-streaming-kafka-cdh511-testing
View on GitHub
Spark Streaming,Kafka and HBase code accompanying the blog 'Offset Management For Apache Kafka With Apache Spark Streaming'
☆23Jun 26, 2017Updated 9 years ago
liekkassmile / flink-connector-clickhouse-1.13
View on GitHub
flink-connector-clickhouse-1.13
☆33Apr 15, 2026Updated 3 months ago
zhang-xzhi / encodingchecker
View on GitHub
file encoding checker
☆17Feb 27, 2022Updated 4 years ago
kzwang / elasticsearch-osem
View on GitHub
Object Search Engine Mapping for ElasticSearch
☆17Feb 13, 2014Updated 12 years ago
redpanda-data / openmessaging-benchmark
View on GitHub
☆40Jul 1, 2026Updated 3 weeks ago
cloudera / kafka-examples
View on GitHub
Kafka Examples repository.
☆44Feb 5, 2019Updated 7 years ago
bill-cc / metadata-hive-hook
View on GitHub
Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.
☆40Jul 23, 2023Updated 3 years ago
ZPGuiGroupWhu / Spark-based-DBSCAN-Algorithms
View on GitHub
A parallel algorithm package for DBSCAN based on Apache Spark, including KDBSCAN, KDSG and other optimized DBSCAN algorithms. This framew…
☆16Jun 17, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
colorknight / moql-transx
View on GitHub
Translate moql syntax to syntax of X database
☆24Dec 3, 2025Updated 7 months ago
s11e-dao / bsin-paas-all-in-one
View on GitHub
☆25Oct 16, 2022Updated 3 years ago
lookout / cassandra-statsd-agent
View on GitHub
Java Agent for Cassandra integration with StatsD
☆13Sep 24, 2015Updated 10 years ago
GatsbyNewton / hive-udf
View on GitHub
UDF, GenericUDF, UDTF, UDAF
☆11Jul 1, 2022Updated 4 years ago
XDgov / weehive
View on GitHub
A minimal Apache Hive server in a Docker image
☆13Dec 24, 2020Updated 5 years ago
glink-incubator / glink
View on GitHub
A Spatial Extension of Apache Flink
☆26Aug 27, 2024Updated last year
flaxsearch / lucene-solr-intervals
View on GitHub
Flax-maintained fork of Lucene/Solr with support for interval queries
☆15Oct 9, 2015Updated 10 years ago