☆34Sep 8, 2024Updated last year
Alternatives and similar repositories for ggml-tutorial
Users that are interested in ggml-tutorial are comparing it to the libraries listed below
Sorting:
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 8 months ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- ☆12Dec 16, 2021Updated 4 years ago
- ☆33Jul 23, 2024Updated last year
- ☆14Feb 3, 2022Updated 4 years ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- 基于rknn的yolov5的cpp实现,包含各种依赖库,是一个完整工程,可直接编译运行☆21Feb 10, 2022Updated 4 years ago
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated 9 months ago
- Keypoints-detection in tensorflow and tensorRT C++☆15Mar 4, 2020Updated 5 years ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 6 months ago
- segmentation algorithm yolact use tensorrt deploy☆14May 7, 2022Updated 3 years ago
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- qwen2 and llama3 cpp implementation☆49Jun 7, 2024Updated last year
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆22Aug 11, 2022Updated 3 years ago
- ☆23Dec 8, 2022Updated 3 years ago
- SGEMM optimization with cuda step by step☆21Mar 23, 2024Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated last week
- Sophgo AI chips driver and runtime library.☆24Feb 5, 2026Updated 3 weeks ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆36Jul 14, 2025Updated 7 months ago
- A plugin to make view transformer from perspective view to bird-eye-view, it is used in bevdet☆25Feb 24, 2023Updated 3 years ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆35Sep 15, 2023Updated 2 years ago
- Flash Attention in raw Cuda C beating PyTorch☆37May 14, 2024Updated last year
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆49Feb 23, 2026Updated last week
- ☆30May 1, 2022Updated 3 years ago
- OpenVINO™ optimization for PointPillars*☆31May 5, 2025Updated 9 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆81May 26, 2025Updated 9 months ago
- LightNet is an optimized deep learning framework based on the popular darknet platform. It is optimized to create efficient and high-spee…☆38Sep 17, 2023Updated 2 years ago
- segment-anything based mnn☆36Dec 13, 2023Updated 2 years ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- ☆20Oct 14, 2025Updated 4 months ago
- BERT Tokenizer in C++☆79Jan 14, 2021Updated 5 years ago
- ppstructure deploy by ncnn☆35Jul 16, 2024Updated last year