Transformer framework for edge computing based on C++.
☆129Nov 11, 2024Updated last year
Alternatives and similar repositories for vt-transformer
Users that are interested in vt-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jun 30, 2025Updated 9 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- ☆33Jul 23, 2024Updated last year
- ☆10Jul 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- yolov8 旋转目标检测部署,瑞芯微RKNN芯片部署、地平线Horizon芯片部署、TensorRT部署☆28Jun 4, 2024Updated last year
- ☆33Feb 3, 2025Updated last year
- Forward and backward Attention DNN operators implementationed by LibTorch, cuDNN, and Eigen.☆31Jun 6, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Translating Torch model to other framework such as Caffe, MxNet ...☆22Dec 16, 2016Updated 9 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- Efficient inference of large language models.☆150Sep 28, 2025Updated 6 months ago
- ☆19Aug 23, 2022Updated 3 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Vitis 部署加速器工作流介绍☆13Jan 10, 2025Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- ☆25Aug 27, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- c++ implementation of mmpose inference, for pose estimation based on MNN☆12Mar 9, 2021Updated 5 years ago
- ☆31May 1, 2022Updated 3 years ago
- 训练营讲义☆20Jan 21, 2025Updated last year
- This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)☆11Jun 27, 2020Updated 5 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year
- 瑞芯微芯片的rknn推理框架部署(yolo模型)☆14Jul 17, 2025Updated 8 months ago
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆56Mar 8, 2026Updated last month
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for ECCV 2022 paper "ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer"☆12Jan 30, 2023Updated 3 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- 用C++实现一个简单的Transformer模型。 Attention Is All You Need。☆53Mar 11, 2021Updated 5 years ago
- nerf☆41Aug 1, 2022Updated 3 years ago
- pose estimation code with deepstream and yolo-pose☆13Oct 14, 2022Updated 3 years ago