Oneflow-Inc / oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
☆8,315Updated this week
Alternatives and similar repositories for oneflow
Users that are interested in oneflow are comparing it to the libraries listed below
Sorting:
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,271Updated last year
- A primitive library for neural network☆1,336Updated 5 months ago
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,463Updated 2 months ago
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,508Updated this week
- TVM Documentation in Chinese Simplified / TVM 中文文档☆1,236Updated 3 weeks ago
- A high performance and generic framework for distributed DNN training☆3,679Updated last year
- OneDiff: An out-of-the-box acceleration library for diffusion models.☆1,880Updated last week
- Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.☆3,364Updated 6 months ago
- MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.☆4,490Updated 9 months ago
- DLRover: An Automatic Distributed Deep Learning System☆1,435Updated this week
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆863Updated 4 months ago
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,150Updated 3 weeks ago
- 《Machine Learning Systems: Design and Implementation》- Chinese Version☆4,385Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,633Updated last month
- Transformer related optimization, including BERT, GPT☆6,147Updated last year
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆403Updated this week
- Bagua Speeds up PyTorch☆883Updated 9 months ago
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,590Updated 5 months ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,521Updated last month
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆943Updated last month
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,259Updated this week
- PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.☆761Updated 2 years ago
- 百亿参数的中英文双语基座大模型☆2,432Updated last year
- Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)☆754Updated 2 months ago
- MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.☆5,012Updated 10 months ago
- compiler learning resources collect.☆2,377Updated last month
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆884Updated last week
- FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.☆3,868Updated 4 months ago
- fastllm是c++实现,后端无依赖(仅依赖CUDA,无需依赖PyTorch)的高性能大模型推理库。 可实现单4090推理DeepSeek R1 671B INT4模型,单路可达20+tps。☆3,539Updated this week
- PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)☆7,078Updated 2 weeks ago