run ChatGLM2-6B in BM1684X
☆49Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for ChatGLM2-TPU
Users that are interested in ChatGLM2-TPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sophgo AI chips driver and runtime library.☆23Mar 11, 2026Updated 2 weeks ago
- ☆44Jul 5, 2024Updated last year
- run chatglm3-6b in BM1684X☆39Mar 1, 2024Updated 2 years ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Oct 30, 2024Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆275Mar 18, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆10Jan 9, 2024Updated 2 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on Sophon TPU.☆36May 25, 2023Updated 2 years ago
- ☆12Dec 16, 2021Updated 4 years ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆877Feb 12, 2026Updated last month
- simplify >2GB large onnx model☆71Nov 30, 2024Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- PyTorch in Go, using LibTorch.☆15May 21, 2019Updated 6 years ago
- ☆30Jun 2, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 使用onnxruntime部署夜间雾霾图像的可见度增强,包含C++和Python两个版本的程序☆13Feb 17, 2024Updated 2 years ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- ☆54Mar 27, 2023Updated 2 years ago
- OpenVINO™ optimization for PointPillars*☆32May 5, 2025Updated 10 months ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆11Aug 11, 2025Updated 7 months ago
- Examples for SophonSDK☆108Aug 11, 2022Updated 3 years ago
- cvitek ai compiler base on MLIR☆23Mar 14, 2022Updated 4 years ago
- TensorRT-FastSAM(https://github.com/CASIA-IVA-Lab/FastSAM)☆23Feb 29, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆52Jan 30, 2024Updated 2 years ago
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆22Aug 11, 2022Updated 3 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆17Apr 30, 2024Updated last year
- ☆25Feb 2, 2024Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- 🎉My Collections of CUDA Kernels~☆10Jun 25, 2024Updated last year
- ChatTTS is a generative speech model for daily dialogue.☆14Oct 21, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Model Quantization Benchmark☆18Updated this week
- OpenWrt 22.03☆13May 9, 2025Updated 10 months ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆15Jul 30, 2025Updated 7 months ago
- ☆36Mar 29, 2023Updated 2 years ago
- export llama to onnx☆136Dec 28, 2024Updated last year
- Multiple Lidar preprocessor for BEVfusion☆10Aug 25, 2023Updated 2 years ago
- 基于Point Transformers复现点云分割任务,并使用HAQ算法进行自动量化压缩,几乎不影响精度☆26Aug 25, 2022Updated 3 years ago