NVIDIA / TensorRT-Edge-LLMLinks
High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI
☆213Updated 3 weeks ago
Alternatives and similar repositories for TensorRT-Edge-LLM
Users that are interested in TensorRT-Edge-LLM are comparing it to the libraries listed below
Sorting:
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆224Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆78Updated 8 months ago
- A simple tool that can generate TensorRT plugin code quickly.☆238Updated 2 years ago
- YOLOv5 on Orin DLA☆221Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆363Updated 3 years ago
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆22Updated 2 weeks ago
- Collection of blogs on AI development☆21Updated last year
- TensorRT Plugin Autogen Tool☆367Updated 2 years ago
- Offline Quantization Tools for Deploy.☆141Updated 2 years ago
- This is a repository to practice multi-thread programming in C++☆27Updated last year
- A tutorial for CUDA&PyTorch☆208Updated last week
- BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).☆551Updated 2 years ago
- ☆311Updated 3 years ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆170Updated 3 years ago
- TensorRT 7 C++ (almost) minimal examples☆84Updated 2 years ago
- Deep Learning tools and applications for NVIDIA AGX platforms.☆265Updated this week
- ☆38Updated last year
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆71Updated 2 years ago
- ☆45Updated 3 years ago
- A fork of the BEVDet series .☆21Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆476Updated 2 months ago
- Llama3 Streaming Chat Sample☆22Updated last year
- High Performance LLM Inference Operator Library☆603Updated this week
- A set of examples around MegEngine☆31Updated 2 years ago
- Serving Inside Pytorch☆170Updated last week
- Python C++ Code Manager☆15Updated last year
- This repository describes how to add a custom TensorRT plugin in c++ and python☆29Updated 4 years ago
- ☆60Updated last year
- ☆26Updated 2 years ago
- ☆152Updated last year