luchangli03/export_llama_to_onnx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/luchangli03/export_llama_to_onnx)

luchangli03 / export_llama_to_onnx

export llama to onnx

☆138

Alternatives and similar repositories for export_llama_to_onnx

Users that are interested in export_llama_to_onnx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luchangli03 / onnxsim_large_model
View on GitHub
simplify >2GB large onnx model
☆72Nov 30, 2024Updated last year
wangzhaode / llm-export
View on GitHub
llm-export can export llm model to onnx.
☆353May 8, 2026Updated 2 months ago
tpoisonooo / llama.onnx
View on GitHub
LLaMa/RWKV onnx models, quantization and testcase
☆368Jul 6, 2023Updated 3 years ago
inisis / OnnxLLM
View on GitHub
Large Language Model Onnx Inference Framework
☆35Nov 25, 2025Updated 7 months ago
sophgo / ChatGLM2-TPU
View on GitHub
run ChatGLM2-6B in BM1684X
☆49Mar 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FeiGeChuanShu / trt2023
View on GitHub
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆43Oct 20, 2023Updated 2 years ago
LCH1238 / BEVDet
View on GitHub
A fork of the BEVDet series .
☆22Oct 8, 2023Updated 2 years ago
wejoncy / QLLM
View on GitHub
A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.
☆190Mar 23, 2026Updated 3 months ago
ytliu74 / RISCV_Verilog
View on GitHub
RISC-V SOC (both single and pipeline) implemented in Verilog. Passed all test codes provided by TA.
☆21Jun 3, 2023Updated 3 years ago
TRT2022 / trtllm-llama
View on GitHub
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆54Oct 20, 2023Updated 2 years ago
Terapines / AI-Benchmark
View on GitHub
RISCV C and Triton AI-Benchmark
☆26Jan 28, 2026Updated 5 months ago
ros-perception / point_cloud_transport_tutorial
View on GitHub
This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…
☆11Mar 17, 2026Updated 4 months ago
cqu20160901 / DETR_onnx_tensorRT_V2
View on GitHub
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Jan 9, 2024Updated 2 years ago
eugene87222 / NYCU_2021spring_Deep_Learning_and_Practice
View on GitHub
☆10Jul 13, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Dominic23331 / sam_onnx
View on GitHub
☆25Apr 22, 2023Updated 3 years ago
cqu20160901 / centernet3d_onnx_rknn_horizon_tensorRT
View on GitHub
CenterNet3D 部署版本，便于移植不同平台（onnx、tensorRT、rknn、Horizon）。
☆14May 24, 2024Updated 2 years ago
OpenPPL / ppl.nn.llm
View on GitHub
☆140Apr 23, 2024Updated 2 years ago
Infrasys-AI / aiinfra-docs
View on GitHub
☆21Nov 6, 2025Updated 8 months ago
lrw04 / llama2.c-to-ncnn
View on GitHub
A converter for llama2.c legacy models to ncnn models.
☆79Dec 17, 2023Updated 2 years ago
hplp / PiMulator
View on GitHub
Processing in Memory Emulation
☆28Feb 24, 2023Updated 3 years ago
ZHEQIUSHUI / CLIP-ONNX-AX650-CPP
View on GitHub
c++实现的clip推理，模型有一点点改动，但是不大，改动和导出模型的代码可以在readme里找到，模型文件都在Releases里，包括AX650的模型。新增支持ChineseCLIP
☆31Jun 19, 2025Updated last year
OpenPPL / ppq
View on GitHub
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
☆1,806Mar 28, 2024Updated 2 years ago
Stephenfang51 / YOLOP-TensorRT
View on GitHub
unofficial implementation of YOLOP TensorRT
☆12Dec 11, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Phoenix8215 / learn-TensorRT-from-scratch
View on GitHub
learn TensorRT from scratch🥰
☆18Sep 29, 2024Updated last year
merrymercy / Awesome-Efficient-LLM
View on GitHub
A curated list for Efficient Large Language Models
☆11Mar 25, 2024Updated 2 years ago
OpenPPL / ppl.nn
View on GitHub
A primitive library for neural network
☆1,367Nov 24, 2024Updated last year
gmalivenko / onnx-opcounter
View on GitHub
Count number of parameters / MACs / FLOPS for ONNX models.
☆96Oct 26, 2024Updated last year
daquexian / faster-rwkv
View on GitHub
☆126Dec 15, 2023Updated 2 years ago
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
ModelTC / LightCompress
View on GitHub
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.
☆735May 14, 2026Updated 2 months ago
fabio-sim / DocShadow-ONNX-TensorRT
View on GitHub
ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀
☆25Sep 13, 2023Updated 2 years ago
wangzhaode / mnn-segment-anything
View on GitHub
segment-anything based mnn
☆37Dec 13, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tsingmicro-toolchain / OnnxSlim
View on GitHub
A Toolkit to Help Optimize Large Onnx Model
☆166Jul 2, 2026Updated 2 weeks ago
GuillaumeVW / NSNet
View on GitHub
This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhanc…
☆40Aug 20, 2020Updated 5 years ago
EdVince / model_zoo
View on GitHub
Recording models
☆12Sep 19, 2023Updated 2 years ago
thfylsty / Yolov7-25d
View on GitHub
基于yolov7 加入 depth回归
☆20Nov 4, 2022Updated 3 years ago
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
penhunt / full-quantization-DNN
View on GitHub
PyTorch code for full quantization of DNN using BCGD
☆14Jul 24, 2019Updated 6 years ago
wangzhaode / mnn-stable-diffusion
View on GitHub
stable diffusion using mnn
☆68Sep 28, 2023Updated 2 years ago