alibaba / feathubLinks
FeatHub - A stream-batch unified feature store for real-time machine learning
☆338Updated last year
Alternatives and similar repositories for feathub
Users that are interested in feathub are comparing it to the libraries listed below
Sorting:
- AI Flow is an open source framework that bridges big data and artificial intelligence.☆180Updated 2 years ago
- Machine learning library of Apache Flink☆322Updated 10 months ago
- This project provides example FeatHub (https://github.com/alibaba/feathub) programs☆30Updated last year
- Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…☆694Updated 10 months ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆261Updated last year
- Remote Shuffle Service for Flink☆191Updated 2 years ago
- MaxCompute spark demo for building a runnable application.☆115Updated 7 months ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Updated 2 years ago
- 本 GitHub 项目是 Flink Forward Asia Hackathon (2021) 的投票专用项目。☆121Updated 3 years ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆427Updated this week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆346Updated 2 months ago
- ☆111Updated last month
- 汇总Apache Hudi相关资料☆559Updated 2 weeks ago
- Benchmarks for Apache Flink☆179Updated 2 months ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆980Updated this week
- Playground for Flink Table Store with use cases and performance features☆50Updated 2 years ago
- Spark ClickHouse Connector build on DataSourceV2 API☆207Updated last week
- alibabacloud-jindodata☆198Updated 3 weeks ago
- ☆568Updated last year
- An experimental materialized view solution based on TiDB/TiKV and Flink with strong consistency support.☆64Updated 3 years ago
- Shuttle:High Available, High Performance Remote Shuffle Service☆156Updated 2 years ago
- ☆66Updated 2 years ago
- A New Way of Data Lake☆48Updated 3 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆301Updated last year
- Benchmarks for queries over continuous data streams.☆358Updated 8 months ago
- It is a high-performance causal inference (statistical model) computing library based on OLAP, which solves the performance bottleneck of…☆168Updated last month
- Spark Connector for Apache Doris☆101Updated this week
- Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators th…☆29Updated 3 years ago
- TiDB connectors for Flink/Hive/Presto☆220Updated last year
- Some useful custom hive udf functions, especial array, json, math, string functions.☆227Updated last year