Deep-Spark / DeepSparkInference
DeepSparkInference has selected 48 inference model examples, covering fields such as computer vision, natural language processing, and speech recognition. Subsequent phases will gradually expand to more AI fields.
☆15Updated this week
Alternatives and similar repositories for DeepSparkInference:
Users that are interested in DeepSparkInference are comparing it to the libraries listed below
- The DeepSpark open platform selects hundreds of open source application algorithms and models that are deeply coupled with industrial app…☆42Updated last month
- DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to suppo…☆59Updated this week
- ☆72Updated 2 years ago
- simplify >2GB large onnx model☆52Updated 2 months ago
- This repository contains the Open Source Software components of the Iluvatar Corex IxRT. It includes the sources for IxRT plugins and dep…☆14Updated 5 months ago
- ☆57Updated 3 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆44Updated last year
- ☆11Updated last year
- ☆35Updated 4 months ago
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆138Updated 2 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 3 years ago
- ☆140Updated 9 months ago
- ☆95Updated 3 years ago
- ☆23Updated last year
- MegEngine到其他框架的转换器☆69Updated last year
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆20Updated 11 months ago
- ☆142Updated last month
- TensorRT encapsulation, learn, rewrite, practice.☆28Updated 2 years ago
- ☆127Updated last month
- 动手学习TVM核心原理教程☆59Updated 4 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- heterogeneity-aware-lowering-and-optimization☆254Updated last year
- Transformer related optimization, including BERT, GPT☆17Updated last year
- ☆124Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 9 months ago
- A breakdown of NCNN☆46Updated 4 years ago
- symmetric int8 gemm☆66Updated 4 years ago
- ☆69Updated last year
- OpenPose uses Pytorch for static quantization, saving, and loading of models☆82Updated 3 years ago
- Serving Inside Pytorch☆154Updated this week