StudyingLover/ggml-tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/StudyingLover/ggml-tutorial)

StudyingLover / ggml-tutorial

☆34

Alternatives and similar repositories for ggml-tutorial

Users that are interested in ggml-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
EdVince / llm-cpp
View on GitHub
☆34Jul 23, 2024Updated last year
Bruce-Lee-LY / cutlass_gemm
View on GitHub
Multiple GEMM operators are constructed with cutlass to support LLM inference.
☆20Aug 3, 2025Updated 11 months ago
LeiWang1999 / TVM.CMakeExtend
View on GitHub
Tutorials of Extending and importing TVM with CMAKE Include dependency.
☆16Oct 11, 2024Updated last year
lovemefan / ggml-learning-notes
View on GitHub
ggml学习笔记，ggml是一个机器学习的推理框架
☆18Mar 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZHEQIUSHUI / CLIP-ONNX-AX650-CPP
View on GitHub
c++实现的clip推理，模型有一点点改动，但是不大，改动和导出模型的代码可以在readme里找到，模型文件都在Releases里，包括AX650的模型。新增支持ChineseCLIP
☆31Jun 19, 2025Updated last year
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated last month
xiaqing10 / RKNN_YOLOV5S_CPP
View on GitHub
基于rknn的yolov5的cpp实现，包含各种依赖库，是一个完整工程，可直接编译运行
☆20Feb 10, 2022Updated 4 years ago
iimmortall / QuantLib
View on GitHub
☆14Feb 3, 2022Updated 4 years ago
yvonwin / qwen2.cpp
View on GitHub
qwen2 and llama3 cpp implementation
☆50Jun 7, 2024Updated 2 years ago
sophgo / libsophon
View on GitHub
Sophgo AI chips driver and runtime library.
☆25Jun 30, 2026Updated 3 weeks ago
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
triple-mu / HunyuanDiT-TensorRT-libtorch
View on GitHub
HunyuanDiT with TensorRT and libtorch
☆18May 22, 2024Updated 2 years ago
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jiangnanboy / doc_ai
View on GitHub
这里将paddle中的ocr等模型转为onnx格式，并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。
☆14Nov 15, 2022Updated 3 years ago
XZLeo / Radar-Detection-with-Deep-Learning
View on GitHub
☆29Aug 17, 2022Updated 3 years ago
MarchLiu / tensor-dancer
View on GitHub
tensor library
☆17Jul 19, 2024Updated 2 years ago
ericperfect / libtorch_tokenizer
View on GitHub
BERT Tokenizer in C++
☆79Jan 14, 2021Updated 5 years ago
aorogat / CBench
View on GitHub
CBench, Benchmarking System for Question Answering Over Knowledge Graphs Systems.
☆12Sep 16, 2022Updated 3 years ago
baidubce / skills
View on GitHub
skills published and maintained by baidu cloud engine
☆26Jun 14, 2026Updated last month
yhwang-hub / OrinMLLM
View on GitHub
This project is primarily used to deploy large language models and multimodal large models on Orin.🚀🚀🚀
☆18Jun 23, 2026Updated last month
zzz3bbb3 / yolact-trt
View on GitHub
segmentation algorithm yolact use tensorrt deploy
☆14May 7, 2022Updated 4 years ago
jundaf2 / CUDA-INT8-GEMM
View on GitHub
CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API
☆37Sep 15, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SJTUSE-bot / SJTUSE-bot
View on GitHub
☆10Aug 14, 2020Updated 5 years ago
lxl24 / SwinTransformerV2_TensorRT
View on GitHub
For 2022 Nvidia Hackathon
☆22Jun 28, 2022Updated 4 years ago
Tartisan / MMDet3d-PointPillars
View on GitHub
PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset
☆23Aug 11, 2022Updated 3 years ago
Oneflow-Inc / conda-env
View on GitHub
☆12Mar 13, 2023Updated 3 years ago
myluluy / iWo
View on GitHub
☆14Jun 1, 2015Updated 11 years ago
zejunwang1 / fastMatch
View on GitHub
Large-scale exact string matching tool
☆17Mar 7, 2025Updated last year
yaoyi30 / ResNet_ncnn_android
View on GitHub
This is an android app about monkey image classification
☆11Jun 16, 2022Updated 4 years ago
delta1037 / RknnInferTemplate
View on GitHub
RKNN模型推理部署模板
☆24Aug 11, 2023Updated 2 years ago
li199603 / sgemm_with_cuda
View on GitHub
SGEMM optimization with cuda step by step
☆23Mar 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 3 years ago
Syencil / Keypoints
View on GitHub
Keypoints-detection in tensorflow and tensorRT C++
☆15Mar 4, 2020Updated 6 years ago
globaledgesoft / Unsupported-Operation-Development-in-SNPE
View on GitHub
This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…
☆10Oct 4, 2021Updated 4 years ago
Kazuhito00 / MobileSAM-ONNX-Sample
View on GitHub
MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル
☆12Apr 11, 2024Updated 2 years ago
JuneTse / ReInceptionE
View on GitHub
☆13Mar 16, 2021Updated 5 years ago
gty111 / GEMM_WMMA
View on GitHub
GEMM by WMMA (tensor core)
☆15Jul 31, 2022Updated 3 years ago
MARD1NO / CUDA-PPT
View on GitHub
☆136Apr 16, 2026Updated 3 months ago