Oneflow-Inc / models
Models and examples built with OneFlow
☆97Updated 6 months ago
Alternatives and similar repositories for models:
Users that are interested in models are comparing it to the libraries listed below
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆401Updated this week
- OneFlow models for benchmarking.☆104Updated 8 months ago
- ☆78Updated last year
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- oneflow documentation☆68Updated 9 months ago
- Datasets, Transforms and Models specific to Computer Vision☆85Updated last year
- ☆214Updated last year
- ☆127Updated 3 months ago
- ☆139Updated 11 months ago
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 3 years ago
- Transformer related optimization, including BERT, GPT☆59Updated last year
- export llama to onnx☆121Updated 3 months ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆472Updated last year
- ☆48Updated this week
- OneFlow->ONNX☆43Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆17Updated last year
- ☆99Updated 3 years ago
- ☆58Updated 5 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆242Updated this week
- Transformer related optimization, including BERT, GPT☆39Updated 2 years ago
- A brief of TorchScript by MNIST☆110Updated 2 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆130Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- ☆71Updated 2 years ago
- Compiler Infrastructure for Neural Networks☆145Updated last year
- ☆90Updated last year
- ☆131Updated last month
- A Tight-fisted Optimizer☆47Updated 2 years ago
- simplify >2GB large onnx model☆55Updated 4 months ago
- Running BERT without Padding☆471Updated 3 years ago