Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies) with DLRM (Deep Learning Recommendation Model)
☆29Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DLRM-FlexFlow
Users that are interested in DLRM-FlexFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 2, 2026Updated last month
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆86Sep 15, 2025Updated 9 months ago
- ☆22Apr 22, 2024Updated 2 years ago
- ☆16Feb 5, 2024Updated 2 years ago
- Execution framework for multi-task model parallelism. Enables the training of arbitrarily large models with a single GPU, with linear spe…☆21Aug 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Cairo lua bindings with extensions for torch☆15Jun 12, 2016Updated 10 years ago
- Global Climate Statistical Analysis Library (GCSAL) allows viewing of climate statistics formulated from over 60 years of data acquisitio…☆32Sep 14, 2021Updated 4 years ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,890Jun 27, 2026Updated last week
- nnScaler: Compiling DNN models for Parallel Training☆132Jun 10, 2026Updated 3 weeks ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆12Jun 28, 2025Updated last year
- ☆25Apr 3, 2023Updated 3 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- A resilient distributed training framework☆99Apr 11, 2024Updated 2 years ago
- On-disk hashtable using linear hashing☆10Nov 9, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆54Dec 11, 2022Updated 3 years ago
- ThyNVM: Transparent hybrid NonVolatile Memory (NOTE: This repo is not working yet. Please refer to the old version: https://github.com/ba…☆29Oct 21, 2017Updated 8 years ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆414Jun 22, 2026Updated last week
- ☆13May 8, 2023Updated 3 years ago
- Implementation based on OSDI paper☆20Feb 11, 2018Updated 8 years ago
- PIN-tool to produce multi-threaded atomic memory traces☆36Oct 22, 2013Updated 12 years ago
- A simulator of a memory controller designed for hybrid DRAM+NVM.☆22Dec 28, 2015Updated 10 years ago
- Linux kernel source tree with fast swap patches.☆20Nov 19, 2013Updated 12 years ago
- ☆12Feb 7, 2013Updated 13 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NVM user-space Primitives API library repository☆18Mar 12, 2014Updated 12 years ago
- Tutorial Material from the SST Team☆27Aug 5, 2025Updated 10 months ago
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆62Feb 8, 2026Updated 4 months ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆21May 26, 2021Updated 5 years ago
- Torch implementation for Robust convolutional neural networks under adversarial noise☆13Mar 8, 2016Updated 10 years ago
- An efficient distillation method for flow matching models☆27Feb 1, 2026Updated 5 months ago
- Saturn accelerates the training of large-scale deep learning models with a novel joint optimization approach.☆24Nov 22, 2023Updated 2 years ago
- A schedule language for large model training☆153Aug 21, 2025Updated 10 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- New Contributor Tasks for Alluxio☆21Oct 1, 2019Updated 6 years ago
- A Torch wrapper for gSLICr super-pixel algorithm☆14Apr 1, 2016Updated 10 years ago
- Operating system demonstrating system transactions☆18Apr 19, 2017Updated 9 years ago
- ☆15Oct 3, 2023Updated 2 years ago
- In memory TPC-C implementation. Used for a number of database research projects.☆39Sep 27, 2020Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago