4paradigm / OpenEmbeddingLinks

OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.

☆33

Alternatives and similar repositories for OpenEmbedding

Users that are interested in OpenEmbedding are comparing it to the libraries listed below

Sorting:

DeepRec-AI / HybridBackend
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
☆158Updated last year
triton-inference-server / hugectr_backend
☆55Updated last year
NVIDIA-Merlin / HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆163Updated this week
alibaba / TePDist
TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.
☆94Updated 2 years ago
DeepRec-AI / serving
A high-performance serving system for DeepRec based on TensorFlow Serving.
☆19Updated last year
bytedance / ps-lite
A lightweight parameter server interface
☆80Updated 2 years ago
ray-project / mobius
Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.
☆98Updated last year
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆267Updated 2 years ago
quiver-team / quiver-feature
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
☆54Updated 3 years ago
kubedl-io / morphling
Automatic tuning for ML model deployment on Kubernetes
☆80Updated 9 months ago
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
tensorflow / networking
Enhanced networking support for TensorFlow. Maintained by SIG-networking.
☆98Updated 3 years ago
Qihoo360 / dgl-operator
The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes
☆44Updated 3 years ago
decis-bench / febench
A Benchmark for Real-Time Relational Data Feature Extraction (VLDB'23 Best Industry Paper Runnerup)
☆51Updated last year
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆143Updated last week
byteps / examples
BytePS examples (Vision, NLP, GAN, etc)
☆19Updated 2 years ago
kleveross / ftlib
Fault-tolerant for DL frameworks
☆70Updated 2 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127Updated 3 years ago
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆118Updated last year
KnightKingWalk / KnightKing
A general-purpose, distributed graph random walk engine.
☆109Updated last year
4paradigm / pafka
Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …
☆67Updated 3 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆60Updated last year
elasticdeeplearning / edl
Elastic Deep Learning for deep learning framework on Kubernetes
☆174Updated 2 years ago
Funatiq / gossip
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆59Updated 3 years ago
alexrenz / AdaPM
A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training
☆20Updated 2 years ago
bytedance / primus
☆217Updated 2 years ago
GHGmc2 / awesome-ml-infra
Building Machine Learning Infrastructure!
☆44Updated 6 years ago
4paradigm / pmemstore
Key/Value Datastore for Persistent Memory
☆27Updated 4 years ago
uwsampl / nexus
☆82Updated last month
Oneflow-Inc / DLPerf
DeepLearning Framework Performance Profiling Toolkit
☆285Updated 3 years ago