carsonpo/quadmul

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/carsonpo/quadmul)

carsonpo / quadmul

a fast and customizable CUDA int4 tensor core gemm

☆15

Alternatives and similar repositories for quadmul

Users that are interested in quadmul are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

carsonpo / safetensors.cpp
View on GitHub
Zero Dependency LibTorch Safetensors Loading and Storing in C++
☆23Jul 12, 2024Updated 2 years ago
carsonpo / megablock
View on GitHub
Mega🅱️lock
☆16Nov 1, 2024Updated last year
THUMNLab / AutoAttend
View on GitHub
Code Implementation for AutoAttend: Automated Attention Representation Search
☆11Jul 26, 2021Updated 4 years ago
PamanGie / yolov8_knowledge_distillation_with_custom_dataset
View on GitHub
YOLOv8 Knowledge Distillation
☆10Dec 28, 2024Updated last year
SamirMoustafa / nmt-with-attention-for-ar-to-en
View on GitHub
simple NMT With Attention For Arabic to English
☆11Mar 5, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wec7 / ML-algotrade
View on GitHub
Algorithmic Trading with Machine Learning
☆15Sep 26, 2015Updated 10 years ago
admineral / RAG-X
View on GitHub
Advanced Video Graph RAG using SAM2,CLIP,BLIP,Qwen2-VL,YOLO-World ,Neo4j, WebGPU, local LLM
☆14Nov 25, 2024Updated last year
zirui-ray-liu / Exact
View on GitHub
☆21Mar 23, 2022Updated 4 years ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
2U1 / DINOv2-Finetune
View on GitHub
An open-source implementaion for fine-tuning DINOv2 by Meta.
☆15Jul 21, 2025Updated last year
BUAADreamer / Qwen2-VL-History
View on GitHub
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
☆15Sep 17, 2024Updated last year
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
facebookresearch / dualformer
View on GitHub
implementation of dualformer
☆25Mar 1, 2025Updated last year
janelu9 / EasyLLM
View on GitHub
Running Large Language Model easily.
☆14Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Zzzzz1 / CSKD
View on GitHub
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…
☆15Nov 5, 2023Updated 2 years ago
HuangCongQing / model-compression-optimization
View on GitHub
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏，量化，剪枝)
☆21Sep 10, 2024Updated last year
lnairGT / CLIP-Distillation
View on GitHub
Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.
☆20Sep 6, 2024Updated last year
heiligerl / AutoPET_Challenge_Submission
View on GitHub
☆11Sep 4, 2022Updated 3 years ago
huyquoctrinh / KDAS
View on GitHub
This is an official implementation of KDAS for Knowledge Distillation Polyp Segmentation (ICME 2024)
☆17Oct 3, 2024Updated last year
malfet / llm_experiments
View on GitHub
☆13Jul 12, 2026Updated last week
sifakis / CS559F21_Demos
View on GitHub
☆16Nov 18, 2021Updated 4 years ago
shotit / shotit
View on GitHub
Shotit is a screenshot-to-video search engine tailored for TV & Film, blazing-fast and compute-efficient.
☆27Jun 29, 2026Updated 3 weeks ago
shankarp8 / knowledge_distillation
View on GitHub
Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).
☆27Aug 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GUOYUDOn / LLM_SelfStudy
View on GitHub
自学LLM的一些笔记与八股
☆26Apr 3, 2025Updated last year
princeton-nlp / PTP
View on GitHub
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
☆32Jul 9, 2024Updated 2 years ago
YigePeng / AutoPET_False_Positive_Reduction
View on GitHub
repository for the MICCAI 2022 AutoPET challenge
☆14Sep 19, 2022Updated 3 years ago
hyhuang00 / moe_inference
View on GitHub
Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
☆19Oct 30, 2024Updated last year
lixiuhong / implicit_gemm_convolution
View on GitHub
☆14May 28, 2019Updated 7 years ago
GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
iamyb / mobileunet
View on GitHub
A lightweight UNet implementation, using Keras
☆14Jan 16, 2020Updated 6 years ago
swaggy-TN / EfficientVLM
View on GitHub
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
☆33Jul 18, 2023Updated 3 years ago
bhimrazy / chat-with-phi-3-vision
View on GitHub
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…
☆34Jan 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nathanrs / go-micrograd
View on GitHub
Andrej Karpathy's micrograd library implemented in Go
☆18May 13, 2025Updated last year
opencomputeproject / FP8
View on GitHub
☆16Jun 23, 2023Updated 3 years ago
hxu296 / torch-evidental-deep-learning
View on GitHub
PyTorch implementation of the original evidental-deep-learning@https://github.com/aamini/evidential-deep-learning/
☆13Sep 20, 2021Updated 4 years ago
xjtulyc / MICCAI2022_paper_reading
View on GitHub
MACCIA 2022 paper reading notes: tasks and datasets
☆12Feb 6, 2023Updated 3 years ago
HaoKang-Timmy / torchanalyse
View on GitHub
A pytorch model profiler with information about macs, energy and e.t.c
☆17Feb 24, 2024Updated 2 years ago
xlite-dev / qwen-image-fast
View on GitHub
⚡️Qwen-Image 4.8x🎉 speedup with Hybrid Acceleration for low VRAM GPUs
☆17Oct 24, 2025Updated 8 months ago
dkobak / iclr-tsne
View on GitHub
Visualizing ICLR submissions
☆16Apr 27, 2023Updated 3 years ago