Fast and memory-efficient exact attention
☆31Dec 2, 2024Updated last year
Alternatives and similar repositories for flash-attention-3
Users that are interested in flash-attention-3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Basic world models☆32Oct 30, 2025Updated 6 months ago
- CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation☆51Apr 9, 2026Updated 3 weeks ago
- SGLang Kernel Wheel Index☆22Updated this week
- ☆24Jun 18, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The entire open source TokenRing ecosystem☆17Apr 22, 2026Updated last week
- ☆27May 3, 2024Updated last year
- ☆13Apr 25, 2026Updated last week
- ☆13Jan 15, 2023Updated 3 years ago
- A rust version of the Caffe library.☆19Jun 16, 2021Updated 4 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!☆16Aug 24, 2024Updated last year
- Joint image and Depth inpainting, ldm3d☆16Apr 28, 2024Updated 2 years ago
- Teaching materials for improving research software writing abilities.☆13Apr 16, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆60Mar 16, 2026Updated last month
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- KSimply: An AI Potential Analyzer that recommends open-source models based on user hardware. / Un analizzatore di potenziale AI che consi…☆19Mar 18, 2026Updated last month
- Make-A-Video Latent Diffusion Model☆19Nov 15, 2023Updated 2 years ago
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- ☆25Updated this week
- Implementation of a holodeck, written in Pytorch☆19Nov 1, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Next.js v15+ template with Tailwind v3+, featuring Microsoft Entra ID authentication via Next-Auth v5+ and a Microsoft Graph Client int…☆10Mar 21, 2026Updated last month
- ☆21Jun 26, 2023Updated 2 years ago
- Triton implementation of Flash Attention2.0☆52Jul 31, 2023Updated 2 years ago
- Base for the Viral Genomics and Bioinformatics Repository☆17Jul 5, 2024Updated last year
- A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference…☆91Updated this week
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆15Jun 5, 2023Updated 2 years ago
- ☆16Oct 20, 2025Updated 6 months ago
- NoC based MPSoC☆11Jul 17, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A PyTorch implementation of [VCT](https://github.com/google-research/google-research/tree/master/vct)☆10Nov 25, 2022Updated 3 years ago
- Basic floating-point components for RISC-V processors☆12Aug 13, 2017Updated 8 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 3 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Apr 7, 2021Updated 5 years ago
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 3 months ago
- PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting [CVPPA: ICCVW 2025]☆34Nov 1, 2025Updated 6 months ago
- unsigned Radix-2 SRT division,基2除法☆16May 12, 2015Updated 10 years ago