flink-extended / clinkLinks
Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators that can be used in both C++ and Java runtime.
☆29Updated 3 years ago
Alternatives and similar repositories for clink
Users that are interested in clink are comparing it to the libraries listed below
Sorting:
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- Benchmarks for Apache Flink☆177Updated last month
- Remote Shuffle Service for Flink☆190Updated 2 years ago
- ☆65Updated 11 months ago
- ☆109Updated 2 weeks ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆259Updated last year
- Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …☆67Updated 3 years ago
- FeatHub - A stream-batch unified feature store for real-time machine learning☆336Updated last year
- Machine learning library of Apache Flink☆317Updated 9 months ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Updated 2 years ago
- AI Flow is an open source framework that bridges big data and artificial intelligence.☆177Updated 2 years ago
- An experimental materialized view solution based on TiDB/TiKV and Flink with strong consistency support.☆64Updated 3 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- A Persistent Key-Value Store designed for Streaming processing☆102Updated 4 months ago
- The preview version of a spillable state backend for Apache Flink☆39Updated 4 years ago
- Dig Spark's source code.☆17Updated last year
- ☆48Updated 3 years ago
- TiDB connectors for Flink/Hive/Presto☆219Updated last year
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆27Updated this week
- Shared files, presentations, and other materials☆35Updated last week
- Port of TPC-DS dsdgen to Java☆21Updated 2 years ago
- alibabacloud-jindodata☆197Updated last week
- Serializable ACID transactions on streaming data☆25Updated 2 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- Scalable NameNode RPC Proxy for HDFS Federation☆85Updated 9 years ago
- ☆105Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆133Updated 2 years ago
- ☆86Updated last week
- Apache Calcite Adapter for Apache Kudu☆28Updated 10 months ago