Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies) with DLRM (Deep Learning Recommendation Model)
☆29Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DLRM-FlexFlow
Users that are interested in DLRM-FlexFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 2, 2026Updated last week
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆86Sep 15, 2025Updated 8 months ago
- ☆23Apr 22, 2024Updated 2 years ago
- ☆16Feb 5, 2024Updated 2 years ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆193Jul 20, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Execution framework for multi-task model parallelism. Enables the training of arbitrarily large models with a single GPU, with linear spe…☆21Aug 13, 2023Updated 2 years ago
- ☆24Updated this week
- Cairo lua bindings with extensions for torch☆15Jun 12, 2016Updated 10 years ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,888Jun 6, 2026Updated last week
- nnScaler: Compiling DNN models for Parallel Training☆132Updated this week
- TensorFlow Runtime Tracing Metadata Visualization☆10Sep 11, 2018Updated 7 years ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆12Jun 28, 2025Updated 11 months ago
- ☆25Apr 3, 2023Updated 3 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of a Tensorflow XLA rematerialization pass☆15Dec 20, 2019Updated 6 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆55Dec 11, 2022Updated 3 years ago
- Generate custom Mac OS folder icons with a desired image as stamp☆12Oct 3, 2023Updated 2 years ago
- Lecture notes of cs294-2017Fall☆10Feb 28, 2018Updated 8 years ago
- ThyNVM: Transparent hybrid NonVolatile Memory (NOTE: This repo is not working yet. Please refer to the old version: https://github.com/ba…☆29Oct 21, 2017Updated 8 years ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆412Jun 6, 2026Updated last week
- ☆13May 8, 2023Updated 3 years ago
- Implementation based on OSDI paper☆20Feb 11, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- PIN-tool to produce multi-threaded atomic memory traces☆36Oct 22, 2013Updated 12 years ago
- A simulator of a memory controller designed for hybrid DRAM+NVM.☆22Dec 28, 2015Updated 10 years ago
- Latex resume template☆12Mar 29, 2012Updated 14 years ago
- An OpenCV Android Camera Sudoku Solver☆11Apr 7, 2017Updated 9 years ago
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆60Feb 8, 2026Updated 4 months ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆20May 26, 2021Updated 5 years ago
- An efficient distillation method for flow matching models☆26Feb 1, 2026Updated 4 months ago
- An FPGA-based NetTLP adapter☆29Mar 10, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Learning a game engine by example.☆10Feb 8, 2016Updated 10 years ago
- A schedule language for large model training☆153Aug 21, 2025Updated 9 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- Control AWS EC2 instances using only their name (forget about Instance-Id's forever). Examples: aws-ssh NAME (connect via ssh), aws-list…☆10Jan 25, 2018Updated 8 years ago
- ☆15Oct 3, 2023Updated 2 years ago
- Operating system demonstrating system transactions☆18Apr 19, 2017Updated 9 years ago