CUDA & Triton Learning Project: Flash Attention 实现探索
☆29Aug 14, 2025Updated 7 months ago
Alternatives and similar repositories for cuda-triton-learning
Users that are interested in cuda-triton-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Jan 20, 2026Updated 2 months ago
- Basic implementation of the Reduced Immersed Method [Brandt, Scandolo, Eisemann, Hildebrandt 2019]☆12Aug 6, 2020Updated 5 years ago
- ☆10Nov 26, 2023Updated 2 years ago
- HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.☆19Feb 16, 2026Updated last month
- Awesome code, projects, books, etc. related to CUDA☆31Feb 3, 2026Updated last month
- 2024 中国人民大学 程序设计Ⅱ荣誉课程 大作业:苏拉卡尔塔棋☆21Jun 10, 2024Updated last year
- List of project ideas for contributors applying to the Google Summer of Code program in 2026 (GSoC 2026).☆26Feb 26, 2026Updated 3 weeks ago
- ☆10Mar 3, 2024Updated 2 years ago
- 共享自习室系统☆17Jun 22, 2021Updated 4 years ago
- Arduino menu library for LCD displays 16x2 and 20x4☆12Sep 9, 2021Updated 4 years ago
- Official codebase for the Siggraph Asia 2025 paper AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry☆48Feb 23, 2026Updated last month
- High Performance Search Platform☆32Aug 1, 2015Updated 10 years ago
- 2024-2025下半学年人工智能导论(拔尖班)☆17Jun 16, 2025Updated 9 months ago
- 使用arduino编程和点灯科技第三方平台,使用esp8266开发,进军IOT物联网!☆10May 13, 2020Updated 5 years ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Oct 18, 2022Updated 3 years ago
- 一个可运行的 `Skill-first + Vector-augmented + LangGraph` RAG 系统,支持多模型厂商、分层记忆和 Web 聊天界面。☆47Updated this week
- Python bindings for fast_cd☆28Nov 29, 2023Updated 2 years ago
- A python implementation of shooting and bouncing rays (PO-SBR), accelerated using OptiX.☆23Aug 13, 2025Updated 7 months ago
- Some interesting book in pdf version☆20Apr 26, 2019Updated 6 years ago
- Wicked fast Conditional Random Fields for Ruby☆37Jan 2, 2023Updated 3 years ago
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated 2 months ago
- 基于MATLAB的直接序列扩频通信系统仿真☆15Apr 15, 2024Updated last year
- ☆14Mar 8, 2025Updated last year
- Framework for Algorithmic Correctness Testing of Operators☆16Mar 9, 2026Updated 2 weeks ago
- A physical eigen to simulate free surface water☆31Aug 15, 2020Updated 5 years ago
- ☆12Mar 7, 2024Updated 2 years ago
- Based on RaytrAMP, using Cuda to calculate MonoRCS in the way of Shooting And Bouncing Ray☆25Nov 18, 2019Updated 6 years ago
- 玩具版小程序的实现,仅仅用于练习和学习小程序的实现,暂无实际用途。☆11Mar 15, 2019Updated 7 years ago
- Accelerating LLM inference with techniques like speculative decoding, quantization, and kernel fusion, focusing on implementing state-of-…☆11Jul 1, 2025Updated 8 months ago
- Thousands of machine learning projects covering all scenarios: getting started, improvement, graduation projects, and job interviews.☆45Updated this week
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 4 months ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆33Dec 5, 2025Updated 3 months ago
- An independent implementation to reproduce the paper "RGB-Infrared Cross-Modality Person Re-Identification" from ICCV2017☆27Jan 4, 2020Updated 6 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- ☆18Nov 11, 2025Updated 4 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- ☆26Aug 28, 2024Updated last year
- ☆12Jan 19, 2020Updated 6 years ago