pingcap / tisparkLinks
TiSpark is built for running Apache Spark on top of TiDB/TiKV
☆887Updated 3 weeks ago
Alternatives and similar repositories for tispark
Users that are interested in tispark are comparing it to the libraries listed below
Sorting:
- TiDB database documentation. TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time…☆614Updated this week
- Placement driver for TiKV☆1,117Updated this week
- TiDB connectors for Flink/Hive/Presto☆219Updated last year
- Apache HAWQ☆695Updated last year
- tidb-tools are some useful tool collections for TiDB.☆312Updated 3 weeks ago
- This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)☆440Updated this week
- Mirror of Apache Kudu☆1,883Updated this week
- The analytical engine for TiDB and TiDB Cloud. Try free: https://tidbcloud.com/free-trial☆988Updated this week
- Apache Trafodion☆246Updated 4 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆141Updated 2 years ago
- High performance data store solution☆1,434Updated last month
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆735Updated last week
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Updated 2 years ago
- TiDB operator creates and manages TiDB clusters running in Kubernetes.☆1,297Updated this week
- A component manager for TiDB☆447Updated this week
- Apache Impala☆1,228Updated this week
- ☆446Updated 2 years ago
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆953Updated this week
- Streaming System 相关的论文读物☆733Updated 3 years ago
- A tool used to collect and merge tidb's binlog for real-time data backup and synchronization.☆297Updated last week
- alibabacloud-jindodata☆197Updated last week
- Apache Phoenix☆1,045Updated this week
- An open-source columnar data format designed for fast & realtime analytic with big data.☆452Updated 2 years ago
- ☆327Updated 4 years ago
- Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads☆1,955Updated 3 weeks ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆259Updated last year
- Pravega - Streaming as a new software defined storage primitive☆2,006Updated 5 months ago
- Scalable NameNode RPC Proxy for HDFS Federation☆85Updated 9 years ago
- Data Migration Platform☆458Updated 3 years ago
- TiDB In Action: based on 4.0☆719Updated last year