alibaba/feathub

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/feathub)

alibaba / feathub

FeatHub - A stream-batch unified feature store for real-time machine learning

☆349

Alternatives and similar repositories for feathub

Users that are interested in feathub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flink-extended / feathub-examples
View on GitHub
This project provides example FeatHub (https://github.com/alibaba/feathub) programs
☆28Sep 21, 2023Updated 2 years ago
flink-extended / flink-remote-shuffle
View on GitHub
Remote Shuffle Service for Flink
☆189Jan 6, 2023Updated 3 years ago
flink-extended / dl-on-flink
View on GitHub
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…
☆693Nov 12, 2024Updated last year
apache / flink-ml
View on GitHub
Machine learning library of Apache Flink
☆333May 26, 2026Updated 2 months ago
flink-extended / clink
View on GitHub
Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators th…
☆30Feb 21, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
4paradigm / FeatInsight
View on GitHub
FeatInsight is a feature platform based on OpenMLDB
☆22Mar 7, 2025Updated last year
flink-extended / ai-flow
View on GitHub
AI Flow is an open source framework that bridges big data and artificial intelligence.
☆182Oct 9, 2022Updated 3 years ago
apache / paimon
View on GitHub
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …
☆3,349Updated this week
feathr-ai / feathr
View on GitHub
Feathr – A scalable, unified data and AI engineering platform for enterprise
☆1,937Apr 4, 2024Updated 2 years ago
apache / flink-benchmarks
View on GitHub
Benchmarks for Apache Flink
☆190May 28, 2026Updated last month
logicalclocks / hopsworks
View on GitHub
Hopsworks - Data-Intensive AI platform with a Feature Store
☆1,301Feb 10, 2025Updated last year
alibaba / pemja
View on GitHub
☆117Apr 23, 2026Updated 3 months ago
feast-dev / feast
View on GitHub
The Open Source Feature Store for AI/ML
☆7,171Updated this week
4paradigm / OpenMLDB
View on GitHub
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and infe…
☆1,698Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alibaba / Alink
View on GitHub
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
☆3,611Jun 7, 2024Updated 2 years ago
Ackuq / spark-pit
View on GitHub
Point-in-Time optimizations for Apache Spark
☆30Jan 18, 2024Updated 2 years ago
DeepRec-AI / DeepRec
View on GitHub
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…
☆1,197Jan 21, 2025Updated last year
alibaba / EasyRec
View on GitHub
A framework for large scale recommendation algorithms.
☆2,350Apr 15, 2026Updated 3 months ago
DataLinkDC / dinky
View on GitHub
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
☆3,744Updated this week
ververica / flink-sql-cookbook
View on GitHub
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…
☆915Jan 12, 2026Updated 6 months ago
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,059Updated this week
nexmark / nexmark
View on GitHub
Benchmarks for queries over continuous data streams.
☆387Dec 26, 2025Updated 6 months ago
ververica / flink-sql-gateway
View on GitHub
☆489Oct 21, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / streampark
View on GitHub
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
☆4,324Updated this week
4paradigm / spark
View on GitHub
This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such a…
☆12Jul 30, 2024Updated last year
apache / flink-kubernetes-operator
View on GitHub
Apache Flink Kubernetes Operator
☆1,021Updated this week
apache / flink-statefun
View on GitHub
Apache Flink Stateful Functions
☆535May 15, 2026Updated 2 months ago
apache / amoro
View on GitHub
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,152Updated this week
decis-bench / febench
View on GitHub
A Benchmark for Real-Time Relational Data Feature Extraction (VLDB'23 Best Industry Paper Runnerup)
☆54Sep 9, 2023Updated 2 years ago
apache / fluss
View on GitHub
Apache Fluss is a streaming storage built for real-time analytics.
☆2,005Updated this week
ververica / frocksdb
View on GitHub
☆70Aug 21, 2024Updated last year
bytedance / bitsail
View on GitHub
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is w…
☆1,676Jan 1, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RealtimeCompute / ververica-connector-demo
View on GitHub
Demos for Flink connectors on Ververica Platform (VVP)
☆44Jun 25, 2025Updated last year
apache / flink-connector-shared-utils
View on GitHub
Apache flink
☆16May 15, 2026Updated 2 months ago
dianfu / pyflink-faq
View on GitHub
Frequently Asked Questions about PyFlink
☆23Mar 1, 2023Updated 3 years ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,577Updated this week
apache / seatunnel
View on GitHub
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
☆9,503Updated this week
apache / flink-cdc
View on GitHub
Flink CDC is a streaming data integration tool
☆6,451Updated this week