zhangtianhong-1998 / Cuda_learnLinks
这是一个从零学习CUDA课程
☆13Updated 11 months ago
Alternatives and similar repositories for Cuda_learn
Users that are interested in Cuda_learn are comparing it to the libraries listed below
Sorting:
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆14Updated last month
- ☆50Updated last month
- ☆27Updated 2 months ago
- 💩里淘金☆26Updated last month
- 这个项目介绍了简单的CUDA入门,涉及到CUDA执行模型、线程层次、CUDA内存模型、核函数的编写方式以及PyTorch使用CUDA扩展的两种方式。通过该项目可以基本入门基于PyTorch的CUDA扩展的开发方式。☆94Updated 3 years ago
- Repository for the CoRL 2024 paper: Cloth-Splatting: 3D Cloth State Estimation from RGB Supervision.☆31Updated 10 months ago
- The first open-source system for large-scale scene reconstruction training and rendering.☆55Updated last year
- An awesome 3DGS models library☆17Updated last year
- Awesome code, projects, books, etc. related to CUDA☆24Updated last month
- cuda编程学习入门☆36Updated last year
- CPU Memory Compiler and Parallel programing☆26Updated 10 months ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆186Updated last month
- the CPU implementation of bucket based farthest point sampling, achieves 7-81x speedup than the conventional implementation☆24Updated 2 years ago
- CUDA implementation of Marching Cubes for Python (Depends on torch)☆79Updated 7 months ago
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features☆21Updated 6 months ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆28Updated last year
- Implement custom operators in PyTorch with cuda/c++☆71Updated 2 years ago
- ☆13Updated last week
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆27Updated last week
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆35Updated 2 weeks ago
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆23Updated this week
- ☆39Updated last year
- A minimal, easy-to-read PyTorch reimplementation of the Qwen3 and Qwen2.5 VL with a fancy CLI☆163Updated last month
- ☆18Updated 2 years ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆52Updated this week
- Automatically hold idle GPU.☆78Updated last month
- 3D Gaussian Splatting (3DGS) extension for Omniverse☆64Updated 2 months ago
- ☆41Updated 4 months ago
- ☆15Updated 9 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year