Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies) with DLRM (Deep Learning Recommendation Model)
☆29Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DLRM-FlexFlow
Users that are interested in DLRM-FlexFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 16, 2025Updated 6 months ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆75Sep 15, 2025Updated 6 months ago
- ☆22Apr 22, 2024Updated last year
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Jul 20, 2022Updated 3 years ago
- Cairo lua bindings with extensions for torch☆15Jun 12, 2016Updated 9 years ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,864Updated this week
- nnScaler: Compiling DNN models for Parallel Training☆126Sep 23, 2025Updated 6 months ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 8 months ago
- ☆25Apr 3, 2023Updated 2 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Dec 20, 2019Updated 6 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Android 人脸检测 android.media, play service, Face++☆11Aug 13, 2016Updated 9 years ago
- ☆15Sep 13, 2025Updated 6 months ago
- A resilient distributed training framework☆97Apr 11, 2024Updated last year
- Generate custom Mac OS folder icons with a desired image as stamp☆12Oct 3, 2023Updated 2 years ago
- ThyNVM: Transparent hybrid NonVolatile Memory (NOTE: This repo is not working yet. Please refer to the old version: https://github.com/ba…☆29Oct 21, 2017Updated 8 years ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆411Jun 14, 2025Updated 9 months ago
- ☆13May 8, 2023Updated 2 years ago
- Implementation based on OSDI paper☆20Feb 11, 2018Updated 8 years ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- PIN-tool to produce multi-threaded atomic memory traces☆37Oct 22, 2013Updated 12 years ago
- A simulator of a memory controller designed for hybrid DRAM+NVM.☆22Dec 28, 2015Updated 10 years ago
- Linux kernel source tree with fast swap patches.☆20Nov 19, 2013Updated 12 years ago
- An OpenCV Android Camera Sudoku Solver☆11Apr 7, 2017Updated 8 years ago
- NVM user-space Primitives API library repository☆18Mar 12, 2014Updated 12 years ago
- Tutorial Material from the SST Team☆25Aug 5, 2025Updated 7 months ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆19May 26, 2021Updated 4 years ago
- Torch implementation for Robust convolutional neural networks under adversarial noise☆13Mar 8, 2016Updated 10 years ago
- Fine-Grained Distributed Computing☆11Feb 15, 2016Updated 10 years ago
- An FPGA-based NetTLP adapter☆27Mar 10, 2020Updated 6 years ago
- LLM-powered Python☆15Updated this week
- A schedule language for large model training☆152Aug 21, 2025Updated 7 months ago
- Transactional memory (mostly Intel® TSX) experiments☆14May 3, 2014Updated 11 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 3 months ago
- New Contributor Tasks for Alluxio☆20Oct 1, 2019Updated 6 years ago
- Control AWS EC2 instances using only their name (forget about Instance-Id's forever). Examples: aws-ssh NAME (connect via ssh), aws-list…☆10Jan 25, 2018Updated 8 years ago
- A Torch wrapper for gSLICr super-pixel algorithm☆14Apr 1, 2016Updated 9 years ago
- ☆15Oct 3, 2023Updated 2 years ago
- Operating system demonstrating system transactions☆17Apr 19, 2017Updated 8 years ago