Ascend / ModelZoo-PyTorch
☆61Updated last year
Alternatives and similar repositories for ModelZoo-PyTorch:
Users that are interested in ModelZoo-PyTorch are comparing it to the libraries listed below
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- Pytorch分布式训练框架☆79Updated 3 weeks ago
- async inference for machine learning model☆26Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- Compare multiple optimization methods on triton to imporve model service performance☆50Updated last year
- ☆41Updated 5 months ago
- ☆22Updated 8 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- 视觉训练框架(简单 / 模块化 / 高扩展 / 分布式 / 自动剪枝)☆30Updated 7 months ago
- README.md☆47Updated last year
- ☆24Updated last year
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆49Updated 2 years ago
- ☆120Updated last year
- Building a VLM model starts from the basic module.☆14Updated last year
- 多模态 MM +Chat 合集☆255Updated 2 months ago
- Trans different platform's network to International Representation(IR)☆44Updated 6 years ago
- ☆25Updated 4 months ago
- YOLOv5 Quantization Aware Training with TensorRT☆15Updated 2 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- https://zhuanlan.zhihu.com/p/396448133☆41Updated 3 years ago
- 使用ONNXRuntime部署阿里达摩院开源DAMO-YOLO目标检测,一共包含27个onnx模型,依然是包含了C++和Python两个版本的程序☆31Updated 2 years ago
- 手摸手 美团 YOLOv6模型训练和TensorRT端到端部署方案教程☆30Updated 2 years ago
- ☆99Updated 3 years ago
- TensorRT简明教程☆26Updated 3 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Updated 2 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆20Updated 2 months ago
- LLM Tokenizer with BPE algorithm☆31Updated 11 months ago
- ☆22Updated last year
- An onnx-based quantitation tool.☆71Updated last year