PersiaML / PERSIA
High performance distributed framework for training deep learning recommendation models based on PyTorch.
☆394Updated this week
Related projects: ⓘ
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆933Updated 2 weeks ago
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆152Updated 4 months ago
- Bagua Speeds up PyTorch☆872Updated last month
- Large batch training of CTR models based on DeepCTR with CowClip.☆162Updated last year
- Running BERT without Padding☆455Updated 2 years ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆260Updated last year
- ☆205Updated last year
- ☆50Updated 11 months ago
- PyTorch On Angel, arming PyTorch with a powerful Parameter Server, which enable PyTorch to train very big models.☆164Updated last year
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆126Updated 3 weeks ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆192Updated 2 years ago
- DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…☆1,024Updated 2 months ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆41Updated 11 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆251Updated 9 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,035Updated 2 weeks ago
- A tensor-aware point-to-point communication primitive for machine learning☆247Updated last year
- ☆315Updated this week
- Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Gene…☆647Updated this week
- ☆375Updated last year
- deepx_core是一个专注于张量计算/深度学习的基础库☆370Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆275Updated 2 years ago
- Dive into Deep Learning Compiler☆640Updated 2 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated last year
- ☆374Updated 6 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆285Updated 4 months ago
- A flexible, high-performance framework for large-scale retrieval problems based on TensorFlow.☆147Updated 2 months ago
- embedx 是基于 c++ 开发的、完全自研的分布式 embedding 训练和推理框架。它目前支持 图模型、深度排序、召回模型和图与排序、图与召回的联合训练模型等☆297Updated 3 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆324Updated last week
- PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.☆288Updated last year
- ☆562Updated 6 years ago