liguodongiot / netron-flask
模型可视化工具netron的Flask版本
☆18Updated 2 years ago
Alternatives and similar repositories for netron-flask:
Users that are interested in netron-flask are comparing it to the libraries listed below
- 基于qwenvl微调一个多模态Xray识别的大模型☆14Updated 5 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆16Updated 6 months ago
- OneFlow->ONNX☆42Updated last year
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- AI开发者平台。目的是要搭建一个采集视频图像并调用API进行智能化数据标注,训练完成之后进行自动化测试的平台。☆29Updated 7 years ago
- 百度QA100万数据集☆47Updated last year
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- Music large model based on InternLM2-chat.☆22Updated 3 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 9 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 将MNN拆解的简易前向推理框架(for study!)☆22Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Updated last year
- ☆24Updated 2 years ago
- Trans different platform's network to International Representation(IR)☆44Updated 6 years ago
- OneFlow Serving☆20Updated 3 months ago
- A more efficient GLM implementation!☆55Updated 2 years ago
- Datasets, Transforms and Models specific to Computer Vision☆85Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工 具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 8 months ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆44Updated 4 months ago
- PaddlePaddle Developer Community☆101Updated this week
- ☆124Updated last year
- Models and examples built with OneFlow☆97Updated 5 months ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated 10 months ago
- 模型压缩的小白入门教程☆22Updated 8 months ago
- TensorRT简明教程☆26Updated 3 years ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆13Updated last year