BestAnHongjun / LMDeploy-JetsonLinks

Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function independently without continuous internet access.

☆98

Alternatives and similar repositories for LMDeploy-Jetson

Users that are interested in LMDeploy-Jetson are comparing it to the libraries listed below

Sorting:

ShaohonChen / Qwen3-SmVL
将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调
☆195Updated last week
yinghuo302 / ascend-llm
基于昇腾310芯片的大语言模型部署
☆21Updated last year
Tlntin / qwen-ascend-llm
☆50Updated 9 months ago
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆301Updated 6 months ago
sesmfs / onnx_quant_tool
An onnx-based quantitation tool.
☆71Updated last year
DataXujing / TensorRT-LLM-ChatGLM3
大模型部署实战：TensorRT-LLM, Triton Inference Server, vLLM
☆26Updated last year
luchangli03 / onnxsim_large_model
simplify >2GB large onnx model
☆61Updated 8 months ago
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆64Updated 11 months ago
TRT2022 / trtllm-llama
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆50Updated last year
ICT-ANS / StarLight
☆61Updated last year
RethinkFun / trian_ppo
☆94Updated 10 months ago
DeepLink-org / dlinfer
☆52Updated this week
torchpipe / torchpipe
Serving Inside Pytorch
☆163Updated last week
ModelTC / LightCompress
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…
☆528Updated last week
thb1314 / mmyolo_tensorrt
☆147Updated last year
shouxieai / tensorRT_quantization
该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。
☆69Updated last year
D-Robotics-AI-Lab / DOSOD
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space
☆88Updated 6 months ago
BestAnHongjun / InternDog
基于InternLM2大模型的离线具身智能导盲犬
☆102Updated last year
chenlamei / MobileVit_TensorRT
TensorRT 2022 亚军方案，tensorrt加速mobilevit模型
☆68Updated 3 years ago
sophgo / ChatGLM2-TPU
run ChatGLM2-6B in BM1684X
☆49Updated last year
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
SmartFlowAI / LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
☆132Updated 11 months ago
TencentARC / mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
☆91Updated 11 months ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆263Updated last week
Tlntin / trt2023
☆26Updated last year
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆42Updated last year
harleyszhang / lite_llama
A light llama-like llm inference framework based on the triton kernel.
☆144Updated last week
sesmfs / onnx_matcher
Using pattern matcher in onnx model to match and replace subgraphs.
☆81Updated last year
AXERA-TECH / ax-llm
Explore LLM model deployment based on AXera's AI chips
☆109Updated 3 weeks ago
AXERA-TECH / OWLVIT-ONNX-AX650-CPP
☆22Updated last year