Fridge003/Cuda-Learn-By-Practice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Fridge003/Cuda-Learn-By-Practice)

Fridge003 / Cuda-Learn-By-Practice

Codebase for Cuda Learning

☆36

Alternatives and similar repositories for Cuda-Learn-By-Practice

Users that are interested in Cuda-Learn-By-Practice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nqchieutb01 / vietnamese-sentence-paraphase
View on GitHub
paraphase sentence
☆11Aug 22, 2025Updated 11 months ago
joelulu / Awesome-Acceleration-GenAI
View on GitHub
Collection of Acceleration Methods for Generative AI
☆29Dec 9, 2025Updated 7 months ago
mettamind-ai / physics_of_llms
View on GitHub
Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)
☆11Oct 21, 2024Updated last year
flash-bon / flash-bon
View on GitHub
(ECCV 2026): Official code for Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models
☆18Jul 9, 2026Updated 2 weeks ago
taosdata / vscode-tdengine
View on GitHub
visual studio code extension for TDengine
☆10Mar 21, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lemyx / tilelang-dsa
View on GitHub
DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang
☆47Nov 19, 2025Updated 8 months ago
sgl-project / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
☆32Updated this week
Dao-AILab / AI-workflow
View on GitHub
☆71Mar 24, 2026Updated 4 months ago
Andy-xiaokang / AP1400-2
View on GitHub
AP1400-2
☆10Aug 5, 2024Updated last year
ishandhanani / srt-slurm
View on GitHub
Benchmark SGLang on SLURM
☆24Apr 20, 2026Updated 3 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
thunlp / CSS-LM
View on GitHub
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆11Jul 1, 2023Updated 3 years ago
hao-ai-lab / flash-attention-fp4
View on GitHub
NVFP4 Flash-Attention 4 on BlackWell
☆30Updated this week
ozyyshr / ShareGPT_investigation
View on GitHub
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))
☆13Dec 21, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fzyzcjy / torch_utils
View on GitHub
Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocatio…
☆114Sep 11, 2025Updated 10 months ago
Arlenelalala / ArxivPaper
View on GitHub
定时爬取arXiv每日论文
☆13May 22, 2023Updated 3 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago
AveryQi115 / 6.824
View on GitHub
☆11Apr 16, 2022Updated 4 years ago
PipeFusion / PipeFusion
View on GitHub
A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters
☆58May 3, 2026Updated 2 months ago
fastai / docments
View on GitHub
Document parameters using comments
☆10Aug 6, 2021Updated 4 years ago
jasperzhong / read-papers-and-code
View on GitHub
My paper/code reading notes in Chinese
☆46Mar 26, 2026Updated 4 months ago
BBuf / AI-Infra-Auto-Driven-SKILLS
View on GitHub
☆696Jul 14, 2026Updated last week
ucd-plse / PyDFix
View on GitHub
PyDFix is a tool that helps detect and fix dependency errors that cause the unreproducibility of Python builds. PyDFix takes as input the…
☆12Feb 7, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ruipeterpan / failfast
View on GitHub
Artifact for "Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs" [arXiv '25]
☆20May 4, 2026Updated 2 months ago
lnis-uofu / ML-Mapper
View on GitHub
☆14Dec 31, 2022Updated 3 years ago
maropu / spark-data-repair-plugin
View on GitHub
Provide functionality to build statistical models to repair dirty tabular data in Spark
☆12Apr 21, 2023Updated 3 years ago
half-dreamer / CS61C-20su
View on GitHub
☆20Dec 24, 2023Updated 2 years ago
patrick-toulme / pyptx
View on GitHub
A Python DSL to write Nvidia PTX for Hopper and Blackwell in JAX and PyTorch
☆367Jul 9, 2026Updated 2 weeks ago
rishisankar / leetgpu
View on GitHub
Solutions to leetgpu CUDA challenges on https://leetgpu.com/
☆19May 25, 2025Updated last year
banburytang / List-of-Chinese-Open-Source-Project-Financing
View on GitHub
☆16Nov 2, 2022Updated 3 years ago
ROCm / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆43Updated this week
Jaykef / Triton-nanoGPT
View on GitHub
Custom triton kernels for training Karpathy's nanoGPT.
☆19Oct 21, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gogongxt / nano-sglang
View on GitHub
☆160Mar 5, 2026Updated 4 months ago
li-plus / flash-preference
View on GitHub
Accelerate LLM preference tuning via prefix sharing with a single line of code
☆52Jul 4, 2025Updated last year
huggingface / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆19Jul 27, 2023Updated 2 years ago
howardlau1999 / autograd
View on GitHub
A simple demonstration of how PyTorch autograd works
☆16Sep 23, 2021Updated 4 years ago
chenyueqi / hotBPF
View on GitHub
☆15Apr 28, 2023Updated 3 years ago
guopp / awesome-dev-blog-article
View on GitHub
学习与开发过程中，发现的比较好的一些博客和文章之类的内容，收集着，利人利己，持续更新。
☆18Mar 8, 2016Updated 10 years ago
ljl0222 / cpu89
View on GitHub
简单改造了自己动手实现CPU的代码
☆10Apr 23, 2021Updated 5 years ago