run ChatGLM2-6B in BM1684X
☆49Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for ChatGLM2-TPU
Users that are interested in ChatGLM2-TPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sophgo AI chips driver and runtime library.☆24Apr 3, 2026Updated last week
- ☆44Jul 5, 2024Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆279Apr 1, 2026Updated 2 weeks ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on Sophon TPU.☆35May 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Dec 16, 2021Updated 4 years ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆898Apr 1, 2026Updated 2 weeks ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- simplify >2GB large onnx model☆71Nov 30, 2024Updated last year
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- PyTorch in Go, using LibTorch.☆15May 21, 2019Updated 6 years ago
- ☆24Aug 14, 2025Updated 8 months ago
- ☆30Jun 2, 2022Updated 3 years ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OpenVINO™ optimization for PointPillars*☆32May 5, 2025Updated 11 months ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 8 months ago
- Examples for SophonSDK☆108Aug 11, 2022Updated 3 years ago
- TensorRT-FastSAM(https://github.com/CASIA-IVA-Lab/FastSAM)☆23Feb 29, 2024Updated 2 years ago
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆53Jan 30, 2024Updated 2 years ago
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆23Aug 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year
- ☆26Feb 2, 2024Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 3 months ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- ChatTTS is a generative speech model for daily dialogue.☆14Oct 21, 2024Updated last year
- Model Quantization Benchmark☆18Mar 23, 2026Updated 3 weeks ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 8 months ago
- export llama to onnx☆137Dec 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multiple Lidar preprocessor for BEVfusion☆11Aug 25, 2023Updated 2 years ago
- 基于Point Transformers复现点云分割任务,并使用HAQ算法进行自动量化压缩,几乎不影 响精度☆26Aug 25, 2022Updated 3 years ago
- pose estimation code with deepstream and yolo-pose☆13Oct 14, 2022Updated 3 years ago
- ☆40Mar 25, 2026Updated 3 weeks ago
- The official content pack for MTS. Here for all to see and use as a guide to make their own.☆14Nov 3, 2025Updated 5 months ago
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- pip-manager is a command line tool to make Python packages management easy.☆11Jun 23, 2020Updated 5 years ago