ggml学习笔记,ggml是一个机器学习的推理框架
☆18Mar 24, 2024Updated last year
Alternatives and similar repositories for ggml-learning-notes
Users that are interested in ggml-learning-notes are comparing it to the libraries listed below
Sorting:
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- ☆10Jul 18, 2024Updated last year
- RKNN模型推理部署模板☆24Aug 11, 2023Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- Header-only safetensors loader and saver in C++☆78Dec 27, 2025Updated 2 months ago
- ☆17Jan 1, 2024Updated 2 years ago
- 瑞芯微芯片的rknn推理框架部署(yolo模型)☆14Jul 17, 2025Updated 7 months ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- Simple C++ FFmpeg video encoder. Raw data to mp4 (h264) file.☆21Jan 24, 2021Updated 5 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- ☆20Dec 29, 2023Updated 2 years ago
- RKNN-YOLOV5-BatchInference-MultiThreadingYOLOV5多张图片多线程C++推理☆22Nov 6, 2023Updated 2 years ago
- A one-page-only CGraph-API-liked DAG project.☆26Feb 11, 2025Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆36Jul 14, 2025Updated 7 months ago
- Example of SenseCraft Model Assistant Model deployment related to ESP32☆32Apr 9, 2025Updated 10 months ago
- ☆28Jun 30, 2025Updated 8 months ago
- yolov8 旋转目标检测部署,瑞芯微RKNN芯片部署、地平线Horizon芯片部署、TensorRT部署☆28Jun 4, 2024Updated last year
- ☆39Feb 12, 2026Updated 2 weeks ago
- [CVPR 2023] OC-SORT implemented in C++ with Eigen Library, Plus a Android Demo Apk☆68Dec 24, 2025Updated 2 months ago
- ☆25Aug 27, 2021Updated 4 years ago
- Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX☆32Nov 13, 2022Updated 3 years ago
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆49Feb 23, 2026Updated last week
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆40Jun 19, 2024Updated last year
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆48Nov 10, 2025Updated 3 months ago
- ☆34Sep 8, 2024Updated last year
- ☆18Jan 12, 2026Updated last month
- ☆20Updated this week
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.☆43Sep 29, 2025Updated 5 months ago
- 并行计算学习笔记☆44Feb 25, 2017Updated 9 years ago
- [ICLR2024]: LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.☆81Sep 20, 2024Updated last year
- Model compression for ONNX☆100Updated this week
- ☆10Apr 13, 2022Updated 3 years ago