sophgo/ChatGLM2-TPU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sophgo/ChatGLM2-TPU)

sophgo / ChatGLM2-TPU

run ChatGLM2-6B in BM1684X

☆49

Alternatives and similar repositories for ChatGLM2-TPU

Users that are interested in ChatGLM2-TPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JKay0327 / whisper-TPU_py
View on GitHub
A whisper repo for TPU
☆11Jun 4, 2024Updated 2 years ago
sophgo / libsophon
View on GitHub
Sophgo AI chips driver and runtime library.
☆26Updated this week
sophgo / ChatGLM3-TPU
View on GitHub
run chatglm3-6b in BM1684X
☆38Mar 1, 2024Updated 2 years ago
ZillaRU / EmotiVoice-TPU
View on GitHub
Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆
☆22Oct 30, 2024Updated last year
sophgo / LLM-TPU
View on GitHub
Run generative AI models in sophgo BM1684X/BM1688
☆293Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cqu20160901 / DETR_onnx_tensorRT_V2
View on GitHub
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Jan 9, 2024Updated 2 years ago
pointpillars-on-openvino / pointpillars-on-openvino
View on GitHub
☆12Dec 16, 2021Updated 4 years ago
sophgo / tpu-mlir
View on GitHub
Machine learning compiler based on MLIR for Sophgo TPU.
☆952Updated this week
luchangli03 / onnxsim_large_model
View on GitHub
simplify >2GB large onnx model
☆72Nov 30, 2024Updated last year
lxl24 / SwinTransformerV2_TensorRT
View on GitHub
For 2022 Nvidia Hackathon
☆22Jun 28, 2022Updated 4 years ago
hpc203 / nighttime_dehaze-onnxrun
View on GitHub
使用onnxruntime部署夜间雾霾图像的可见度增强，包含C++和Python两个版本的程序
☆13Feb 17, 2024Updated 2 years ago
jat001 / gotorch
View on GitHub
PyTorch in Go, using LibTorch.
☆15May 21, 2019Updated 7 years ago
LittleRain626 / rknn_yolov5_3588_bytetrack
View on GitHub
☆31Jun 2, 2022Updated 4 years ago
keddyjin / TensorRT_StableDiffusion_ControlNet
View on GitHub
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…
☆26Jul 21, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
intel / OpenVINO-optimization-for-PointPillars
View on GitHub
OpenVINO™ optimization for PointPillars*
☆32May 5, 2025Updated last year
sophgo / tpu_compiler
View on GitHub
cvitek ai compiler base on MLIR
☆23Mar 14, 2022Updated 4 years ago
AXERA-TECH / ONNX-YOLO-World-Open-Vocabulary-Object-Detection
View on GitHub
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU
☆12Aug 11, 2025Updated 11 months ago
DataXujing / Bert_TensorRT
View on GitHub
Bert TensorRT模型加速部署
☆10Apr 1, 2022Updated 4 years ago
thb1314 / tensorrt-onnx-fasterrcnn-fpn-roialign
View on GitHub
☆53Mar 27, 2023Updated 3 years ago
sophon-ai-algo / examples
View on GitHub
Examples for SophonSDK
☆107Aug 11, 2022Updated 3 years ago
kexinhuang12345 / CASTER
View on GitHub
CASTER: Predicting Drug Interactions with Chemical Substructure Representation (AAAI 2020)
☆25Oct 28, 2020Updated 5 years ago
richjjj / cuvid-tensorrt-multi
View on GitHub
ffmpeg+cuvid+tensorrt+multicamera
☆12Dec 31, 2024Updated last year
Done4 / FFmpegMPPDecoder
View on GitHub
h264的软解和硬解，基于FFmpeg和MPP
☆11Mar 23, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Linaom1214 / TensorRT-FastSAM
View on GitHub
TensorRT-FastSAM(https://github.com/CASIA-IVA-Lab/FastSAM)
☆23Feb 29, 2024Updated 2 years ago
ZHEQIUSHUI / SAM-ONNX-AX650-CPP
View on GitHub
SAM and lama inpaint，包含QT的GUI交互界面，实现了交互式可实时显示结果的画点、画框进行SAM，然后通过进行Inpaint，具体操作看readme里的视频。
☆54Jan 30, 2024Updated 2 years ago
newintelligence4 / BEVfusion_preprocess
View on GitHub
Multiple Lidar preprocessor for BEVfusion
☆11Aug 25, 2023Updated 2 years ago
Tartisan / MMDet3d-PointPillars
View on GitHub
PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset
☆23Aug 11, 2022Updated 3 years ago
drilistbox / FlashOCC_on_UniOcc_and_RenderOCC
View on GitHub
☆26Feb 2, 2024Updated 2 years ago
KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
ZillaRU / ChatTTS-ONNX
View on GitHub
ChatTTS is a generative speech model for daily dialogue.
☆14Oct 21, 2024Updated last year
bytedance / MRECG
View on GitHub
☆36Mar 29, 2023Updated 3 years ago
YdrMaster / cuda-driver
View on GitHub
基于 CUDA Driver API 的 cuda 运行时环境
☆16Jul 30, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Gwencong / deepstream-pose
View on GitHub
pose estimation code with deepstream and yolo-pose
☆13Oct 14, 2022Updated 3 years ago
sophgo / tpu-mq
View on GitHub
Model Quantization Benchmark
☆20Apr 17, 2026Updated 3 months ago
wangzhaode / mnn-stable-diffusion
View on GitHub
stable diffusion using mnn
☆68Sep 28, 2023Updated 2 years ago
GesilaA / deepsort_tensorrt
View on GitHub
This is a TensorRT based deepsort project
☆79Sep 24, 2021Updated 4 years ago
OpenPPL / ppl.pmx
View on GitHub
☆61Nov 21, 2024Updated last year
sophgo / tdl_sdk
View on GitHub
☆47Jun 30, 2026Updated 3 weeks ago
deep-practice / FastBEV-TensorRT
View on GitHub
☆15Apr 18, 2023Updated 3 years ago