allwefantasy/spark-binlog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allwefantasy/spark-binlog)

allwefantasy / spark-binlog

A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).

☆152

Alternatives and similar repositories for spark-binlog

Users that are interested in spark-binlog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allwefantasy / delta-plus
View on GitHub
A library based on delta for Spark and MLSQL
☆60Dec 24, 2020Updated 5 years ago
allwefantasy / mlsql-plugins
View on GitHub
☆13Jun 17, 2022Updated 4 years ago
bebee4java / sqlalarm
View on GitHub
Big data smart alarm by sql
☆12May 11, 2021Updated 5 years ago
byzer-org / byzer-lang
View on GitHub
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
☆1,835May 29, 2024Updated 2 years ago
teeyog / blog
View on GitHub
My Blog
☆76May 3, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
teeyog / IQL
View on GitHub
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
☆377Dec 16, 2023Updated 2 years ago
haojinIntel / streaming_benchmark
View on GitHub
☆11Jun 30, 2026Updated 3 weeks ago
dxer / dataLink
View on GitHub
简单易用的ETL工具
☆17Mar 28, 2019Updated 7 years ago
passionke / starry
View on GitHub
fast spark local mode
☆35Aug 20, 2018Updated 7 years ago
AirToSupply / hudi-spark-plus
View on GitHub
A library based on Hudi for Spark.
☆10Nov 30, 2021Updated 4 years ago
aistack / sql-booster
View on GitHub
This is a library for SQL optimizing/rewriting including Materialized View rewrite
☆70Jun 21, 2022Updated 4 years ago
harbby / sylph
View on GitHub
Stream computing platform for bigdata
☆406Apr 24, 2024Updated 2 years ago
allwefantasy / mlsql-api-console
View on GitHub
☆22Jun 21, 2022Updated 4 years ago
edp963 / wormhole
View on GitHub
Wormhole is a SPaaS (Stream Processing as a Service) Platform
☆975Nov 16, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cjuexuan / mynote
View on GitHub
☆233Sep 15, 2022Updated 3 years ago
running-elephant / moonbox
View on GitHub
Moonbox is a DVtaaS (Data Virtualization as a Service) Platform
☆505Apr 14, 2023Updated 3 years ago
jerryshao / spark-hive-streaming-sink
View on GitHub
A sink to save Spark Structured Streaming DataFrame into Hive table
☆29Apr 16, 2018Updated 8 years ago
hortonworks-spark / shc
View on GitHub
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
☆546May 10, 2021Updated 5 years ago
allwefantasy / spark-hbase
View on GitHub
A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).
☆15Sep 29, 2023Updated 2 years ago
DTStack / flinkStreamSQL
View on GitHub
基于开源的flink，对其实时sql进行扩展；主要实现了流与维表的join，支持原生flink SQL所有的语法
☆2,052Feb 21, 2024Updated 2 years ago
Qihoo360 / XSQL
View on GitHub
Unified SQL Analytics Engine Based on SparkSQL
☆211Dec 5, 2022Updated 3 years ago
DTStack / chunjun
View on GitHub
A data integration framework
☆4,105Dec 2, 2025Updated 7 months ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lw-lin / CoolplaySpark
View on GitHub
酷玩 Spark: Spark 源代码解析、Spark 类库等
☆3,475May 18, 2022Updated 4 years ago
allwefantasy / pyjava
View on GitHub
This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…
☆49Jun 15, 2026Updated last month
jizhang / spark-sandbox
View on GitHub
A playground for Spark jobs.
☆43Dec 8, 2018Updated 7 years ago
alibaba / SparkCube
View on GitHub
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
☆136Mar 6, 2023Updated 3 years ago
WeBankFinTech / Scriptis
View on GitHub
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…
☆813Dec 11, 2024Updated last year
lizhitao0923 / ansible-hadoop
View on GitHub
Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash
☆10Mar 21, 2017Updated 9 years ago
neoremind / kraps-rpc
View on GitHub
A RPC framework leveraging Spark RPC module
☆207Mar 13, 2019Updated 7 years ago
bluejoe2008 / spark-http-stream
View on GitHub
spark structured streaming via HTTP communication
☆18Jul 7, 2022Updated 4 years ago
sq-q / hudi-spark-utilities-plus
View on GitHub
hudi-spark-utilities-plus
☆11Jul 29, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
allwefantasy / mlsql-web-console
View on GitHub
☆18Jun 16, 2021Updated 5 years ago
cas-packone / ambari-kylin-service
View on GitHub
☆30Dec 24, 2022Updated 3 years ago
Qihoo360 / Quicksql
View on GitHub
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
☆2,041Oct 25, 2022Updated 3 years ago
mbalassi / flink-parcel
View on GitHub
Flink parcel for Cloudera Manager
☆22Aug 1, 2019Updated 6 years ago
hairless / plink
View on GitHub
Platform for Flink
☆280Jan 3, 2023Updated 3 years ago
LinMingQiang / sparkstreaming
View on GitHub
封装sparkstreaming动态调节batch time(有数据就执行计算)；支持运行过程中增删topic；封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
☆181Apr 15, 2021Updated 5 years ago