run ChatGLM2-6B in BM1684X
☆49Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for ChatGLM2-TPU
Users that are interested in ChatGLM2-TPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- Sophgo AI chips driver and runtime library.☆24Apr 20, 2026Updated 2 weeks ago
- ☆44Jul 5, 2024Updated last year
- run chatglm3-6b in BM1684X☆39Mar 1, 2024Updated 2 years ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Oct 30, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Run generative AI models in sophgo BM1684X/BM1688☆282Apr 16, 2026Updated 2 weeks ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on Sophon TPU.☆35May 25, 2023Updated 2 years ago
- ☆12Dec 16, 2021Updated 4 years ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆910Updated this week
- simplify >2GB large onnx model☆71Nov 30, 2024Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- PyTorch in Go, using LibTorch.☆15May 21, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆30Jun 2, 2022Updated 3 years ago
- 使用onnxruntime部署夜间雾霾图像的可见度增强,包含C++和Python两个版本的程序☆13Feb 17, 2024Updated 2 years ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- ☆54Mar 27, 2023Updated 3 years ago
- OpenVINO™ optimization for PointPillars*☆32May 5, 2025Updated 11 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 8 months ago
- Examples for SophonSDK☆107Aug 11, 2022Updated 3 years ago
- CASTER: Predicting Drug Interactions with Chemical Substructure Representation (AAAI 2020)☆25Oct 28, 2020Updated 5 years ago
- cvitek ai compiler base on MLIR☆23Mar 14, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TensorRT-FastSAM(https://github.com/CASIA-IVA-Lab/FastSAM)☆23Feb 29, 2024Updated 2 years ago
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆23Aug 11, 2022Updated 3 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated 2 years ago
- ☆26Feb 2, 2024Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- ☆37Feb 11, 2026Updated 2 months ago
- OpenWrt 22.03☆13May 9, 2025Updated 11 months ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- ☆36Mar 29, 2023Updated 3 years ago
- export llama to onnx☆138Dec 28, 2024Updated last year
- Multiple Lidar preprocessor for BEVfusion☆11Aug 25, 2023Updated 2 years ago