liguodongiot / netron-flaskLinks
模型可视化工具netron的Flask版本
☆18Updated 2 years ago
Alternatives and similar repositories for netron-flask
Users that are interested in netron-flask are comparing it to the libraries listed below
Sorting:
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆156Updated last year
- 跨平台的容器化Linux桌面环境☆70Updated 4 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- ncnn和pnnx格式编辑器☆134Updated 9 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆18Updated 10 months ago
- Music large model based on InternLM2-chat.☆22Updated 6 months ago
- 🔨🔨🔨Tool for making model training data set☆19Updated 8 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆49Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆87Updated last year
- run chatglm3-6b in BM1684X☆39Updated last year
- 机器学习基础☆8Updated 6 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆34Updated 3 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 4 years ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆13Updated 2 years ago
- PaddlePaddle Developer Community☆117Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last year
- 💡💡💡awesome compute vision app in gradio☆53Updated last year
- a simple lightweight large language model pipeline framework.☆25Updated 2 months ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆85Updated last year
- 将MNN拆解的简易前向推理框架(for study!)☆23Updated 4 years ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to suppo…☆64Updated 2 weeks ago
- Paddle Automatically Diff Precision Toolkits.☆49Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- Tiny C++11 GPT-2 inference implementation from scratch☆62Updated this week
- OneFlow->ONNX☆43Updated 2 years ago
- Efficient inference of large language models.☆149Updated last month