☆220Jun 15, 2023Updated 3 years ago
Alternatives and similar repositories for primus
Users that are interested in primus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆162Apr 20, 2024Updated 2 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆260May 12, 2024Updated 2 years ago
- DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…☆1,192Jan 21, 2025Updated last year
- FeatHub - A stream-batch unified feature store for real-time machine learning☆349May 27, 2024Updated 2 years ago
- Multi-Cluster application progressive delivery controller☆21Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Mar 31, 2023Updated 3 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆532Mar 4, 2024Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆207May 22, 2026Updated last month
- An overview of Complex Event Processing Systems☆29Apr 6, 2022Updated 4 years ago
- A TensorFlow Extension: GPU performance tools for TensorFlow.☆26Jul 27, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- ☆220Aug 17, 2023Updated 2 years ago
- Themis MapReduce and TritonSort☆11Nov 2, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆929Dec 30, 2024Updated last year
- Tiray-egw(Tiray External Gateway) run on dpdk. EGW is used for Layer-4/7 load balancer and NATGW and VPC cloud network.☆11Feb 2, 2021Updated 5 years ago
- Bagua Speeds up PyTorch☆880Aug 1, 2024Updated last year
- transparently transmit context within or between goroutines☆29Dec 19, 2025Updated 6 months ago
- ☆13Aug 13, 2024Updated last year
- A Distributed Engine for AI☆49Jun 22, 2026Updated last week
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆97Apr 22, 2023Updated 3 years ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆414Jun 22, 2026Updated last week
- Kubernetes-native Deep Learning Framework☆744Jan 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,941Updated this week
- This is an incubating repository of the Apache SkyWalking AIOps Engine☆39Nov 1, 2023Updated 2 years ago
- GLake: optimizing GPU memory management and IO transmission.☆501Mar 24, 2025Updated last year
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,065Mar 12, 2026Updated 3 months ago
- 利用tensorflow/serving进行单模型、多模型、同一模型多版本的部署,并进行模型预测,并用Prothemus进行服务监控。☆11Feb 24, 2021Updated 5 years ago
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- Demos for Flink connectors on Ververica Platform (VVP)☆44Jun 25, 2025Updated last year
- ☆12Jun 14, 2020Updated 6 years ago
- Fault-tolerant for DL frameworks☆71Jul 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Cloud Native Batch System (Project under CNCF)☆5,714Updated this week
- ☆22Jun 5, 2019Updated 7 years ago
- The source code for paper LeCo: Lightweight Compression via Learning Serial Correlations (SIGMOD'24).☆17Mar 26, 2024Updated 2 years ago
- 阿里巴巴ESMM模型解读☆44Aug 6, 2020Updated 5 years ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆754Oct 24, 2023Updated 2 years ago
- ☆27Sep 11, 2023Updated 2 years ago
- This project provides example FeatHub (https://github.com/alibaba/feathub) programs☆28Sep 21, 2023Updated 2 years ago