Yangxiaoz/GGML-Tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yangxiaoz/GGML-Tutorial)

Yangxiaoz / GGML-Tutorial

To better understand the ggml library

☆30

Alternatives and similar repositories for GGML-Tutorial

Users that are interested in GGML-Tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mz24cn / gemm_optimization
View on GitHub
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…
☆17Mar 28, 2019Updated 7 years ago
xiaodong-lx / tplink-ipc-control
View on GitHub
TPLink IPC Control
☆20Jul 24, 2024Updated last year
KAIWEILIUCC / Awesome-LLM-IoT-Papers
View on GitHub
A collection of papers on LLM applications in the IoT field.
☆22Jan 21, 2026Updated 6 months ago
KarhouTam / cuda-kernels
View on GitHub
Some common CUDA kernel implementations (Not the fastest).
☆30Jun 24, 2026Updated 3 weeks ago
QimingZheng / gemmlab
View on GitHub
☆25Mar 31, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ASU-ESIC-FAN-Lab / RepNet
View on GitHub
☆13Jul 3, 2025Updated last year
UbiquitousLearning / MobileFM
View on GitHub
One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…
☆30Mar 5, 2024Updated 2 years ago
yvonwin / qwen2.cpp
View on GitHub
qwen2 and llama3 cpp implementation
☆50Jun 7, 2024Updated 2 years ago
lmyybh / cuda-kernel
View on GitHub
☆24Apr 2, 2026Updated 3 months ago
adnansirajrakin / TBT-CVPR2020
View on GitHub
In the repository we provide a sample code to implement the Targeted Bit Trojan attack.
☆20Nov 7, 2020Updated 5 years ago
Ying1123 / VTC-artifact
View on GitHub
☆48Jun 7, 2024Updated 2 years ago
CentOS / centos-cloud
View on GitHub
☆11May 5, 2020Updated 6 years ago
AamirRaihan / SWAT
View on GitHub
Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.
☆29Jul 23, 2021Updated 4 years ago
tong2prosperity / milligrad-rs
View on GitHub
auto grad in rust with video explanation.
☆26Jun 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KULeuven-MICAS / snax-gemm
View on GitHub
☆17Jul 1, 2024Updated 2 years ago
enyac-group / Elana
View on GitHub
Elana: A Simple Energy & Latency Analyzer for LLMs
☆16Apr 3, 2026Updated 3 months ago
masmullin2000 / qemu-runner
View on GitHub
a script to help with running qemu
☆12May 19, 2024Updated 2 years ago
d-matrix-ai / keyformer-llm
View on GitHub
Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning
☆57Mar 26, 2024Updated 2 years ago
xmouyang / ClusterFL
View on GitHub
Repo for MobiSys 2021 paper: "ClusterFL: A Similarity-Aware Federated Learning System for Human Activity Recognition".
☆39Apr 4, 2023Updated 3 years ago
harnets / multiverse
View on GitHub
GPU-accelerated LLM Training Simulator
☆22Jun 26, 2025Updated last year
Allen-C-Guan / Pytorch-Inductor-Tutorial
View on GitHub
☆97Jun 26, 2026Updated 3 weeks ago
fzinfz / book
View on GitHub
Tech notes for mkdocs and gitbook
☆18Updated this week
VITA-Group / GraNet
View on GitHub
[Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration
☆31Feb 11, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mk-fg / cgroup-tools
View on GitHub
A set of tools to work with cgroup tree and process classification/QoS according to it
☆10Oct 1, 2019Updated 6 years ago
yzhaiustc / Optimizing-SGEMV-on-NVIDIA-GPUs
View on GitHub
An implementation of SGEMV with performance comparable to cuBLAS.
☆12May 21, 2021Updated 5 years ago
leo-project / leo_erasure
View on GitHub
Erasure code library for Erlang
☆12Sep 5, 2024Updated last year
chyyuu / rt-patch-analysis
View on GitHub
☆11Sep 15, 2017Updated 8 years ago
FellouAI / fellou-blog
View on GitHub
Fellou news - fellou.ai/blog
☆23Oct 24, 2025Updated 8 months ago
iiicp / study-llvm-from-scratch
View on GitHub
llvm slides and books and other
☆61Feb 2, 2025Updated last year
Enter-tainer / simplerv
View on GitHub
☆15Oct 23, 2023Updated 2 years ago
ethz-spylab / misleading-privacy-evals
View on GitHub
Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)
☆13Apr 29, 2024Updated 2 years ago
lewei50 / LeweiTcpClient
View on GitHub
An arduino example（TCP） with Lewei50 (IOT) platform for free use .
☆12Mar 25, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
66RING / tiny-flash-attention
View on GitHub
flash attention tutorial written in python, triton, cuda, cutlass
☆527Jan 20, 2026Updated 6 months ago
xxxxyu / FlexNN
View on GitHub
[MobiCom 24] Adaptive DNN inference under memory constraints
☆57Jan 22, 2025Updated last year
jinb-park / kfuzz
View on GitHub
A bunch of sample codes related to kernel fuzzing
☆12Feb 7, 2019Updated 7 years ago
yuanmu97 / PacketGame
View on GitHub
[SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
☆15Jul 1, 2023Updated 3 years ago
fhboswell / llvm-analysis-and-transform-passes
View on GitHub
LLVM passes with usage instructions
☆18Apr 23, 2017Updated 9 years ago
dongzhiyan-stack / async_memory_reclaim_for_cold_file_area
View on GitHub
linux内核异步内存回收的另一个思路：基于冷热文件的冷热区域精准的回收冷文件页page(可做成内核ko)
☆13Jun 14, 2024Updated 2 years ago
dpc-grindland / Cudagrind
View on GitHub
A Valgrind extension for CUDA, unofficial mirror for https://www.hlrs.de/organization/av/spmt/research/cudagrind/
☆10Aug 5, 2015Updated 10 years ago