w3hbi / Fundamentals_of_Accelerated_Computing_with_CUDA_PythonLinks
Practice exercises and assessments for NVIDIA DLI's "Fundamentals of Accelerated Computing with CUDA Python" course.
☆23Updated 2 years ago
Alternatives and similar repositories for Fundamentals_of_Accelerated_Computing_with_CUDA_Python
Users that are interested in Fundamentals_of_Accelerated_Computing_with_CUDA_Python are comparing it to the libraries listed below
Sorting:
- GPU Kernels☆210Updated 7 months ago
- Notes on quantization in neural networks☆113Updated 2 years ago
- 100 days of building GPU kernels!☆552Updated 7 months ago
- ☆74Updated last year
- ☆24Updated 11 months ago
- coding CUDA everyday!☆71Updated last week
- ☆113Updated last week
- making the official triton tutorials actually comprehensible☆82Updated 3 months ago
- 1st Place Team Crane: @aswinkumar1999 @rathull @kyolebu☆29Updated 3 months ago
- ☆404Updated 8 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆395Updated last month
- ☆46Updated 8 months ago
- A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation an…☆19Updated 2 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆441Updated 9 months ago
- ☆69Updated 3 weeks ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆214Updated this week
- Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.☆523Updated 3 months ago
- Implementation of a methodology that allows all sorts of user defined GPU kernel fusion, for non CUDA programmers.☆32Updated this week
- Some CUDA example code with READMEs.☆179Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- E2E AutoML Model Compression Package☆46Updated 9 months ago
- ☆89Updated 8 months ago
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Updated 6 months ago
- ☆227Updated 11 months ago
- Improving AI Systems with Self-Defense Mechanisms☆22Updated 9 months ago
- documentation for content creation☆231Updated 2 months ago
- Learning to Skip the Middle Layers of Transformers☆15Updated 4 months ago
- Transmute AI Lab Model Efficiency Toolkit☆19Updated 2 years ago
- ☆59Updated 2 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆427Updated 9 months ago