Oneflow-Inc / oneflowLinks
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
☆8,686Updated this week
Alternatives and similar repositories for oneflow
Users that are interested in oneflow are comparing it to the libraries listed below
Sorting:
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,165Updated 2 weeks ago
- A high performance and generic framework for distributed DNN training☆3,681Updated last year
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,277Updated 2 years ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆1,478Updated last month
- OneDiff: An out-of-the-box acceleration library for diffusion models.☆1,887Updated 3 weeks ago
- MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架☆4,794Updated 7 months ago
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,468Updated 2 months ago
- DLRover: An Automatic Distributed Deep Learning System☆1,474Updated this week
- A primitive library for neural network☆1,343Updated 6 months ago
- Transformer related optimization, including BERT, GPT☆6,173Updated last year
- MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.☆4,516Updated 10 months ago
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,522Updated 3 weeks ago
- 百亿参数的中英文双语基座大模型☆2,434Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,522Updated last month
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆871Updated 5 months ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,655Updated last week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,319Updated this week
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆3,350Updated this week
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆406Updated 2 weeks ago
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆1,978Updated last week
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆890Updated 2 weeks ago
- DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, Aligned…☆3,061Updated last year
- This is a Chinese translation of the CUDA programming guide☆1,550Updated 6 months ago
- cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,标注平台,自动化标注,大模型微调,vllm大模型推…☆1,674Updated last month
- oneAPI Deep Neural Network Library (oneDNN)☆3,796Updated this week
- ☆607Updated 11 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,762Updated this week
- Serve, optimize and scale PyTorch models in production☆4,330Updated this week
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,370Updated 6 months ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,509Updated last week