DeepSparkInference has selected 216 inference models of both small and large sizes. The small models cover fields such as computer vision, natural language processing, and speech recognition; the LLMs involve various frameworks including vLLM, TGI and LMDeploy. This repository is the mirror of Gitee.
☆28Mar 25, 2026Updated this week
Alternatives and similar repositories for DeepSparkInference
Users that are interested in DeepSparkInference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The DeepSpark open platform selects hundreds of open source application algorithms and models that are deeply coupled with industrial app…☆47Updated this week
- This repository contains the Open Source Software components of the Iluvatar Corex IxRT. It includes the sources for IxRT plugins and dep…☆17Mar 19, 2026Updated last week
- DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to suppo…☆70Updated this week
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- ☆34Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆13Sep 25, 2020Updated 5 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 3 years ago
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- ☆16Aug 18, 2015Updated 10 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Oct 20, 2023Updated 2 years ago
- ☆10Apr 8, 2022Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Aug 9, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆18Jan 4, 2024Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Feb 8, 2026Updated last month
- Embed Python in Unreal Engine 4☆11Aug 13, 2021Updated 4 years ago
- Implementation of the Spatio-Temporal Hierarchical Matching Pursuit (ST-HMP) descriptor presented in the paper: M. Madry, L. Bo, D. Kragi…☆14Aug 4, 2014Updated 11 years ago
- ☆24Oct 16, 2025Updated 5 months ago
- Benchmark of TVM quantized model on CUDA☆112Jun 19, 2020Updated 5 years ago
- Software Engineer / Indie Hacker☆13Updated this week
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- Claude Code Skill:内容生产线☆63Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper, From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process☆24Sep 9, 2024Updated last year
- Observe workers as they pass in front of a camera to determine if they have adequate safety protection.☆10Jan 3, 2023Updated 3 years ago
- Optimum version of a UI for Stable Diffusion, running on ONNX models for faster inference, working on most common GPU vendors: NVIDIA,AMD…☆26Jan 5, 2024Updated 2 years ago
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Official PyTorch implementation of "The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignm…☆20Dec 9, 2024Updated last year
- ☆10Feb 26, 2020Updated 6 years ago
- 记 录各种工具的使用方法,包括并不限于 Git、Mac、WebStorm、Atom、VS Code、Nginx☆10Jul 20, 2018Updated 7 years ago
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Apr 1, 2023Updated 2 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Nov 24, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fody AddIn that allows declarative padding structures and classes to fight the false sharing problem.☆12May 24, 2016Updated 9 years ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated last year
- TensorRT encapsulation, learn, rewrite, practice.☆29Oct 19, 2022Updated 3 years ago
- Elasticsearch provider for Examine in Umbraco v8☆12Jan 15, 2024Updated 2 years ago
- ☆16Jul 5, 2019Updated 6 years ago
- Single Image Deraining Toolbox and Benchmark☆18May 30, 2023Updated 2 years ago
- Easily add searching to your apps☆14Mar 23, 2026Updated last week