aryagxr / cudaView external linksLinks
coding CUDA everyday!
☆73Feb 5, 2026Updated last week
Alternatives and similar repositories for cuda
Users that are interested in cuda are comparing it to the libraries listed below
Sorting:
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Jun 24, 2025Updated 7 months ago
- IJMLC: Open-TI: Open Traffic Intelligence with Augmented Language Model☆22Jul 30, 2025Updated 6 months ago
- A Heterogeneous GPU Platform for Chipyard SoC☆42Updated this week
- ☆46May 20, 2025Updated 8 months ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆30Apr 22, 2025Updated 9 months ago
- Pipeline Parallelism Emulation and Visualization☆77Jan 8, 2026Updated last month
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆457Mar 10, 2025Updated 11 months ago
- Best Movie App with Ionic 4 using The Movie DB API☆16May 24, 2019Updated 6 years ago
- ☆10Apr 26, 2023Updated 2 years ago
- extensible collectives library in triton☆95Mar 31, 2025Updated 10 months ago
- ☆130Aug 18, 2025Updated 5 months ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆88Jan 7, 2026Updated last month
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆163Updated this week
- A ratatui based vertical and horizontal slider.☆35Jan 7, 2026Updated last month
- my first ever browser game☆10Jun 21, 2025Updated 7 months ago
- A Multi-graph Multi-head Adaptive Temporal Graph Convolutional Network☆11May 21, 2023Updated 2 years ago
- ☆10Sep 4, 2025Updated 5 months ago
- ☆11Sep 2, 2024Updated last year
- ☆12Oct 29, 2024Updated last year
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆233Jun 15, 2025Updated 7 months ago
- Pure C inference for the GTE Small embedding model☆101Jan 21, 2026Updated 3 weeks ago
- Toolchain built around the Megatron-LM for Distributed Training☆86Dec 7, 2025Updated 2 months ago
- Eclipse plugin for CppUTest unit test harness☆19Jul 26, 2022Updated 3 years ago
- ☆28Dec 15, 2025Updated last month
- ☆14Dec 14, 2025Updated last month
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- ☆12Dec 20, 2024Updated last year
- Ver 1.1 by Mike Tucker / published by CreativeApplications.Net☆17Jul 12, 2011Updated 14 years ago
- Generate Linux Perf event tables for Apple Silicon☆17Dec 16, 2025Updated last month
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated last month
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- A web app built with Nuxt.js that provides real-time preview of Markdown content. Compose and format Markdown text while seeing instant v…☆10Jul 3, 2023Updated 2 years ago
- 南开大学网络空间安全学院计算机组成原理2023spring☆13Jan 22, 2024Updated 2 years ago
- ☆11Jan 19, 2024Updated 2 years ago
- Official code repository for the paper titled "Efficient Molecular Conformer Generation with SO(3) Averaged Flow-Matching and Reflow" (IC…☆13Jan 8, 2026Updated last month
- ☆11Mar 13, 2024Updated last year
- ☆11Nov 30, 2023Updated 2 years ago
- A simple showstart script☆11May 6, 2024Updated last year
- Open-source keyboard firmware for Atmel AVR and Arm USB families☆14Jul 30, 2024Updated last year