pingcap / tisparkLinks
TiSpark is built for running Apache Spark on top of TiDB/TiKV
☆889Updated 5 months ago
Alternatives and similar repositories for tispark
Users that are interested in tispark are comparing it to the libraries listed below
Sorting:
- Placement driver for TiKV☆1,141Updated this week
- TiDB database documentation. TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time…☆617Updated this week
- Mirror of Apache Kudu☆1,894Updated last week
- This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)☆448Updated last week
- TiDB connectors for Flink/Hive/Presto☆217Updated last year
- Apache HAWQ☆695Updated last year
- The analytical engine for TiDB and TiDB Cloud. Try free: https://tidbcloud.com/free-trial☆1,001Updated this week
- tidb-tools are some useful tool collections for TiDB.☆320Updated 2 weeks ago
- TiDB operator creates and manages TiDB clusters running in Kubernetes.☆1,313Updated last week
- ☆328Updated 4 years ago
- Apache Impala☆1,256Updated this week
- A component manager for TiDB☆459Updated last week
- Apache Trafodion☆245Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆752Updated 3 weeks ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Updated 2 years ago
- BaikalDB, A Distributed HTAP Database.☆1,228Updated 2 months ago
- High performance data store solution☆1,443Updated last month
- Data Migration Platform☆454Updated 3 years ago
- TBase is an enterprise-level distributed HTAP database. Through a single database cluster to provide users with highly consistent distrib…☆1,425Updated 5 months ago
- TiDB In Action: based on 4.0☆719Updated last year
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆142Updated 2 years ago
- A tool used to collect and merge tidb's binlog for real-time data backup and synchronization.☆295Updated 3 weeks ago
- Pravega - Streaming as a new software defined storage primitive☆2,006Updated 9 months ago
- An experimental materialized view solution based on TiDB/TiKV and Flink with strong consistency support.☆64Updated 4 years ago
- Streaming System 相关的论文读物☆736Updated 3 years ago
- Scalable NameNode RPC Proxy for HDFS Federation☆86Updated 9 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆261Updated last year
- Elastic data processing with Apache Pulsar and Apache Flink☆281Updated 3 years ago
- An open-source columnar data format designed for fast & realtime analytic with big data.☆452Updated 3 years ago
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆1,107Updated this week