🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
☆50Feb 23, 2024Updated 2 years ago
Alternatives and similar repositories for CUDA-Learn-Note
Users that are interested in CUDA-Learn-Note are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 14, 2024Updated last year
- CUDA Based De-dispersion library☆12Jun 8, 2024Updated 2 years ago
- ☆11Nov 8, 2017Updated 8 years ago
- 🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.☆47Jan 25, 2024Updated 2 years ago
- Python bindings to the PSRDada ringbuffer implementation☆11Jan 30, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A 3D fluid simulation on the GPU using C++ and Vulkan.☆13Jun 12, 2022Updated 4 years ago
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- High level Gazebo simulation for the Unitree Robotics' Aliengo, A1 and Go1 quadruped robots.☆11Nov 2, 2023Updated 2 years ago
- 安卓大作业,仿微信,简单UI,未对接后台,老人版微信☆10Jan 4, 2020Updated 6 years ago
- Adaptive Topology Reconstruction for Robust Graph Representation Learning [Efficient ML Model]☆10Feb 11, 2025Updated last year
- ☆12Aug 21, 2019Updated 6 years ago
- ☆13Apr 16, 2024Updated 2 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆11,245May 29, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆36Oct 16, 2025Updated 7 months ago
- Work in progress object cutting based on Nvidia Flex☆15May 17, 2021Updated 5 years ago
- [ICML'25] Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting | 样本级别的自适应多模型集成时间序列预测☆29May 22, 2025Updated last year
- Parallel Prefix Sum (Scan) with CUDA☆29Jun 22, 2024Updated last year
- molecular dynamics (MD) simulation of 10^13 atoms.☆12Nov 22, 2024Updated last year
- Mitigation of periodic as well as narrow-band and spiky/bursty RFI from time-domain filterbank data.☆18Apr 23, 2021Updated 5 years ago
- ☆13May 14, 2024Updated 2 years ago
- Implementation of our paper: Komaritzan and Botsch, Fast Projective Skinning, ACM MIG 2019.☆58Jan 27, 2024Updated 2 years ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis☆37Oct 27, 2025Updated 7 months ago
- A C++ port of karpathy/micrograd, a tiny scalar-valued autograd engine and a neural net library☆13Nov 24, 2023Updated 2 years ago
- ☆21May 13, 2022Updated 4 years ago
- 图神经网络在推荐系统的应用☆13Aug 26, 2021Updated 4 years ago
- ☆23May 10, 2023Updated 3 years ago
- ☆11Jun 13, 2022Updated 4 years ago
- Multi-GPU Framework for Voxel Grid Computations☆68Mar 26, 2026Updated 2 months ago
- Implementation of analytic collision penalty eigensystems (with Matlab)☆19Oct 23, 2025Updated 7 months ago
- Unstructured computations on emerging architectures.☆17Jun 1, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- taichi hackathon repo.☆18Dec 15, 2022Updated 3 years ago
- ☆47Dec 13, 2025Updated 6 months ago
- A reference implementation of "WRAPD: Weighted Rotation-aware ADMM for Parameterization and Deformation" written in C++. This code suppor…☆13Aug 9, 2021Updated 4 years ago
- CoRdE model implementation: simulating ropes, chains, and other elastic strings☆11May 8, 2020Updated 6 years ago
- SuperTerrain+: A real-time procedural 3D infinite terrain engine with geographical features and photorealistic rendering.☆18Apr 6, 2023Updated 3 years ago
- A Winograd Minimal Filter Implementation in CUDA☆30Aug 25, 2021Updated 4 years ago
- some physics implemented on Taichi-AOT & Unity☆17Dec 4, 2022Updated 3 years ago