run ChatGLM2-6B in BM1684X
☆49Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for ChatGLM2-TPU
Users that are interested in ChatGLM2-TPU are comparing it to the libraries listed below
Sorting:
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- ☆12Dec 16, 2021Updated 4 years ago
- Sophgo AI chips driver and runtime library.☆24Feb 5, 2026Updated last month
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Jan 9, 2024Updated 2 years ago
- ☆44Jul 5, 2024Updated last year
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on Sophon TPU.☆36May 25, 2023Updated 2 years ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Oct 30, 2024Updated last year
- 使用onnxruntime部署夜间雾霾图像的可见度增强,包含C++和Python两个版本的程序☆13Feb 17, 2024Updated 2 years ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆270Feb 26, 2026Updated last week
- run chatglm3-6b in BM1684X☆39Mar 1, 2024Updated 2 years ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆873Feb 12, 2026Updated 3 weeks ago
- OpenVINO™ optimization for PointPillars*☆31May 5, 2025Updated 10 months ago
- ☆30Jun 2, 2022Updated 3 years ago
- ☆54Mar 27, 2023Updated 2 years ago
- ☆36Mar 29, 2023Updated 2 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆22Aug 11, 2022Updated 3 years ago
- simplify >2GB large onnx model☆71Nov 30, 2024Updated last year
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- 🎉My Collections of CUDA Kernels~