liguodongiot / netron-flask
模型可视化工具netron的Flask版本
☆18Updated 2 years ago
Alternatives and similar repositories for netron-flask:
Users that are interested in netron-flask are comparing it to the libraries listed below
- NVIDIA TensorRT Hackathon 2023复赛选 题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 6 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆44Updated last year
- OneFlow->ONNX☆42Updated last year
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 3 years ago
- run ChatGLM2-6B in BM1684X☆49Updated 11 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆48Updated last year
- PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)☆80Updated this week
- ncnn和pnnx格式编辑器☆130Updated 4 months ago
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Updated 3 years ago
- Paddle Automatically Diff Precision Toolkits.☆49Updated 10 months ago
- PaddlePaddle Developer Community☆97Updated this week
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated 11 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 8 months ago
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆145Updated 8 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆54Updated last month
- A light llama-like llm inference framework based on the triton kernel.☆88Updated this week
- Trans different platform's network to International Representation(IR)☆44Updated 6 years ago
- Music large model based on InternLM2-chat.☆22Updated last month
- ☆33Updated last year
- 使用ONNXRuntime部署PicoDet目标检测,包含C++和Python两个版本的程序☆28Updated 2 years ago
- Models and examples built with OneFlow☆96Updated 4 months ago
- 模型压缩的小白入门教程☆22Updated 7 months ago
- TenniS: Tensor based Edge Neural Network Inference System☆13Updated 11 months ago
- Transformer related optimization, including BERT, GPT☆17Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆45Updated 3 months ago
- ☆19Updated 3 years ago
- ☆14Updated 10 months ago