sifakis / CS639S23_DemosLinks
Software artifacts and Demos for CS639 (Spring 2023) "Parallel and Throughput-Optimized Programming"
☆18Updated 2 years ago
Alternatives and similar repositories for CS639S23_Demos
Users that are interested in CS639S23_Demos are comparing it to the libraries listed below
Sorting:
- ☆28Updated 2 weeks ago
- A set of hands-on tutorials for CUDA programming☆240Updated last year
- Complete software package for the Iris Lunar Rover (CMU).☆16Updated 11 months ago
- CUDA Guide☆73Updated last year
- ☆10Updated 9 months ago
- CUDA implementation of parallel Depth First Search (DFS) algorithm and it's comparison with a serial C++ DFS implementation.☆29Updated 7 years ago
- Code and data for paper "(How) do Language Models Track State?"☆19Updated 6 months ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆22Updated 8 months ago
- ☆16Updated 10 months ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆67Updated this week
- ☆145Updated 4 months ago
- Introduction to CUDA programming and debugging☆16Updated 2 years ago
- ☆22Updated 5 months ago
- Competitive GPU kernel optimization platform.☆107Updated this week
- Learning about CUDA by writing PTX code.☆143Updated last year
- ☆27Updated last year
- ☆40Updated 3 weeks ago
- ☆27Updated 3 months ago
- 📄Small Batch Size Training for Language Models☆63Updated last week
- Personal solutions to the Triton Puzzles☆20Updated last year
- The evaluation framework for training-free sparse attention in LLMs☆101Updated 3 months ago
- 6.790 | Machine Learning | Draft Site/Notes☆13Updated this week
- ☆34Updated last year
- High-Performance SGEMM on CUDA devices☆107Updated 8 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆39Updated last year
- Experiment of using Tangent to autodiff triton☆80Updated last year
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆119Updated 3 months ago
- Effective transpose on Hopper GPU☆25Updated last month
- Neural Optimal Transport with Lagrangian Costs☆58Updated 4 months ago
- ☆32Updated last year