Complete solutions to the Programming Massively Parallel Processors Edition 4
☆690Jun 18, 2025Updated 9 months ago
Alternatives and similar repositories for pmpp
Users that are interested in pmpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 18, 2025Updated 10 months ago
- Learn CUDA with PyTorch☆253Mar 14, 2026Updated last week
- ☆18Aug 20, 2025Updated 7 months ago
- Material for gpu-mode lectures☆5,865Feb 1, 2026Updated last month
- This comprehensive learning repository is designed to transform software engineers into expert AI kernel developers, focusing on the cutt…☆51Mar 19, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GPU programming related news and material links☆2,060Mar 8, 2026Updated 2 weeks ago
- Learnings and programs related to CUDA☆435Jun 29, 2025Updated 8 months ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆34Jul 8, 2025Updated 8 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- ☆3,401Mar 11, 2026Updated 2 weeks ago
- Step by step implementation of a fast softmax kernel in CUDA☆63Jan 6, 2025Updated last year
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆876Mar 29, 2025Updated 11 months ago
- ☆93Nov 11, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Jul 16, 2024Updated last year
- ABLATE is a UB CHREST project focused on leveraging advances in both exascale computing and machine learning to better understand the tur…☆12May 7, 2025Updated 10 months ago
- GPU Engineering for AI Systems☆271Oct 26, 2025Updated 5 months ago
- ☆29Nov 9, 2025Updated 4 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆27Jan 14, 2025Updated last year
- llama2 inference engine in Rust☆13Apr 12, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,119Aug 26, 2025Updated 7 months ago
- ☆25Nov 10, 2025Updated 4 months ago
- RWKV-based Text-to-Speech implementation in Rust☆26Oct 14, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GPU Kernels☆222Apr 27, 2025Updated 11 months ago
- ☆16Aug 7, 2021Updated 4 years ago
- ☆91Feb 29, 2024Updated 2 years ago
- OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.☆28Jul 24, 2025Updated 8 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆440Feb 22, 2025Updated last year
- ☆46Mar 31, 2025Updated 11 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- Puzzles for learning Triton☆2,348Mar 18, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆98Nov 17, 2024Updated last year
- ☆418Apr 10, 2025Updated 11 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- Deep memory and sequence models in JAX☆23Jan 15, 2026Updated 2 months ago
- 100 days of CUDA Challenge☆49Aug 2, 2025Updated 7 months ago
- ☆22May 5, 2025Updated 10 months ago
- Datomic MCP Server so your AI model can query your database (uses Modex MCP library)☆26Apr 5, 2025Updated 11 months ago