Oneflow-Inc/OneFlow-Benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Oneflow-Inc/OneFlow-Benchmark)

Oneflow-Inc / OneFlow-Benchmark

OneFlow models for benchmarking.

☆103

Alternatives and similar repositories for OneFlow-Benchmark

Users that are interested in OneFlow-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Oneflow-Inc / DLPerf
View on GitHub
DeepLearning Framework Performance Profiling Toolkit
☆292Mar 28, 2022Updated 4 years ago
Oneflow-Inc / oneflow-documentation
View on GitHub
oneflow documentation
☆69Jun 26, 2024Updated 2 years ago
SkyworkAI / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆17Jun 3, 2024Updated 2 years ago
Oneflow-Inc / conda-env
View on GitHub
☆12Mar 13, 2023Updated 3 years ago
Oneflow-Inc / libai
View on GitHub
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
☆403Jul 31, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Oneflow-Inc / models
View on GitHub
Models and examples built with OneFlow
☆100Oct 16, 2024Updated last year
microsoft / Delayed-Compensation-Asynchronous-Stochastic-Gradient-Descent-for-Multiverso
View on GitHub
Asynchronous Stochastic Gradient Descent with Delay Compensation
☆22Jun 9, 2017Updated 9 years ago
wyg1997 / neovimplus
View on GitHub
auto deploy neovim like chxuan/vimplus
☆12Apr 22, 2025Updated last year
lirundong / quant-pack
View on GitHub
[Archived Project] Codebase for network quantization study.
☆12May 20, 2020Updated 6 years ago
HKBU-HPML / ddl-benchmarks
View on GitHub
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆36May 29, 2020Updated 6 years ago
DTennant / tl-YOLOv2
View on GitHub
A tensorlayer implementation of YOLOv2: Object Detection for both image and video!
☆12Jun 20, 2018Updated 8 years ago
Oneflow-Inc / oneflow_convert
View on GitHub
OneFlow->ONNX
☆42Apr 19, 2023Updated 3 years ago
Oneflow-Inc / one-fx
View on GitHub
A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.
☆13Apr 7, 2023Updated 3 years ago
uwsampl / dtr-prototype
View on GitHub
Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
☆133Jul 6, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
Adlik / model_zoo
View on GitHub
☆11Dec 26, 2025Updated 6 months ago
bytedance / byteps
View on GitHub
A high performance and generic framework for distributed DNN training
☆3,717Oct 3, 2023Updated 2 years ago
hogepodge / tvm-docker
View on GitHub
A basic Docker-based installation of TVM
☆11Jun 23, 2022Updated 4 years ago
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
epfml / LocalSGD-Code
View on GitHub
☆46Mar 4, 2020Updated 6 years ago
houkensjtu / taichi-hackathon-akinasan
View on GitHub
Akinasan team（秋名山车队）'s code base for the 0th Taichi Hackathon.
☆19Dec 4, 2022Updated 3 years ago
CSshengxy / MEC
View on GitHub
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Apr 9, 2019Updated 7 years ago
Jittor / jittor
View on GitHub
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
☆3,227Jul 13, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Oneflow-Inc / oneflow-lite
View on GitHub
☆17Jan 1, 2024Updated 2 years ago
carefree0910 / carefree-flow
View on GitHub
Deep Learning ❤️ OneFlow
☆19Aug 26, 2021Updated 4 years ago
magruener / reconstructing-proprietary-video-streaming-algorithms
View on GitHub
This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"
☆14Mar 24, 2021Updated 5 years ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 5 months ago
NVlabs / iccv2019-mixed-precision-tutorial
View on GitHub
☆28Jul 20, 2020Updated 6 years ago
PhilJd / contiguous_pytorch_params
View on GitHub
Accelerate training by storing parameters in one contiguous chunk of memory.
☆294Oct 29, 2020Updated 5 years ago
KuangjuX / cuda-evolve-oss
View on GitHub
Autonomous GPU kernel optimization system driven by AI agents.
☆31Mar 29, 2026Updated 3 months ago
bytedance / ByteTransformer
View on GitHub
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Mar 15, 2024Updated 2 years ago
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
xuqifan897 / Optimus
View on GitHub
☆28Jul 11, 2021Updated 5 years ago
BaguaSys / bagua-core
View on GitHub
Core communication lib for Bagua.
☆48Sep 15, 2021Updated 4 years ago
zphang / adaptive-computation-time-pytorch
View on GitHub
Alex Graves' Adaptive Computation Time in PyTorch
☆14Jan 9, 2018Updated 8 years ago
GuanhuaWang / sensAI
View on GitHub
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
☆65Jul 25, 2024Updated last year
d2l-ai / d2l-tvm
View on GitHub
Dive into Deep Learning Compiler
☆649Jun 19, 2022Updated 4 years ago
alibaba / FastNN
View on GitHub
FastNN provides distributed training examples that use EPL.
☆85Mar 11, 2022Updated 4 years ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago