ZRayZzz/flash-attention-v100

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZRayZzz/flash-attention-v100)

ZRayZzz / flash-attention-v100

☆78

Alternatives and similar repositories for flash-attention-v100

Users that are interested in flash-attention-v100 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weishengying / tiny-flash-attention
View on GitHub
使用 cutlass 实现 flash-attention 精简版，具有教学意义
☆59Aug 12, 2024Updated last year
HuyNguyen-hust / hopper-gemm-101
View on GitHub
☆14Dec 22, 2024Updated last year
FlorianRhiem / VFRendering
View on GitHub
A vector field rendering library
☆17Jul 31, 2019Updated 6 years ago
Faraz9877 / H100_GEMM
View on GitHub
High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Cluste…
☆11Dec 4, 2024Updated last year
jeremyxu2010 / toy-compiler
View on GitHub
个人学习编译原理、理解创造一个编译器主体流程的小项目
☆10Oct 7, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lvyufeng / uie_mindspore
View on GitHub
☆12Mar 21, 2024Updated 2 years ago
basnijholt / variational-quantum-monte-carlo
View on GitHub
2014: Variational Monte Carlo for the harmonic oscillator, helium, hydrogen and H2 - IPython notebook and FORTRAN90
☆13Jun 23, 2016Updated 10 years ago
alpha0422 / torch-graph
View on GitHub
Simple PyTorch graph capturing.
☆21May 31, 2023Updated 3 years ago
enp1s0 / cuMpSGEMM
View on GitHub
Fast SGEMM emulation on Tensor Cores
☆17Feb 16, 2025Updated last year
JINO-ROHIT / kernels
View on GitHub
writing really fast kernels
☆19Jul 15, 2026Updated 2 weeks ago
bohanw / jpeg_comp_verilog
View on GitHub
JPEG Compression RTL implementation
☆11Aug 19, 2017Updated 8 years ago
mazurowski-lab / segment-anything2-medical-evaluation
View on GitHub
☆13Aug 26, 2024Updated last year
regymm / mit_sd_controller_improved
View on GitHub
Improved version of http://web.mit.edu/6.111/volume2/www/f2018/tools/sd_controller.v
☆13Dec 6, 2021Updated 4 years ago
Reconfigurable-Computing / HLStoFPGA
View on GitHub
☆12Jul 20, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tlc-pack / libflash_attn
View on GitHub
Standalone Flash Attention v2 kernel without libtorch dependency
☆113Sep 10, 2024Updated last year
enochii / simple-pointer-analysis
View on GitHub
Pointer analysis prototype (currently including anderson, steensgard).
☆17Dec 20, 2021Updated 4 years ago
deciding / cutez
View on GitHub
CuTeDSL tutorials, tools, autotuner, profiler, etc.
☆41Jun 27, 2026Updated last month
Bald0Wang / llm-langchain-quick-start
View on GitHub
llm langchain quick start
☆16Jun 14, 2023Updated 3 years ago
arpitp / ssd-controller
View on GitHub
Open Source SSD Controller. NVMe and Lightstor variants
☆17May 21, 2014Updated 12 years ago
jianyicheng / HLS-benchmarks
View on GitHub
Benchmarks for High-Level Synthesis
☆11Mar 17, 2023Updated 3 years ago
regymm / ymmcu-ft2232
View on GitHub
FT2232HL JTAG & UART Downloader
☆22Jul 18, 2021Updated 5 years ago
EnricoCancelli / ProximitySocialNav
View on GitHub
repository for "Exploiting Proximity-Aware Tasks for Embodied Social Navigation" paper code
☆12Nov 16, 2023Updated 2 years ago
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Jul 21, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flagos-ai / DeepSeek-V4-FlagOS
View on GitHub
☆16Jul 18, 2026Updated last week
YuxueYang1204 / CudaDemo
View on GitHub
Implement custom operators in PyTorch with cuda/c++
☆77Jan 1, 2023Updated 3 years ago
robertkist / qtmodernredux
View on GitHub
QtModernRedux is a modern dark window and widget theme for PySide6 and PySide2
☆36May 19, 2023Updated 3 years ago
snakebang007 / nas_video_downloader
View on GitHub
一款专注于下载视频、图集的工具，支持抖音、小红书、哔哩哔哩等平台。支持部署在 docker，支持绿联云等NAS。特别推荐部署在绿联云上（因为支持 docker 远程访问）。
☆15Jul 9, 2026Updated 2 weeks ago
flagos-ai / skills
View on GitHub
FlagOS skills for model deployment, HW adaptation, train&infer, eval, kernel dev and perf tuning
☆19Jul 18, 2026Updated last week
emstoudenmire / parallelDMRG
View on GitHub
Real-space parallel density matrix renormalization group (DMRG) based on ITensor
☆20Dec 22, 2020Updated 5 years ago
fpgasystems / groundhog
View on GitHub
Groundhog - Serial ATA Host Bus Adapter
☆24Jun 10, 2018Updated 8 years ago
gty111 / GEMM_WMMA
View on GitHub
GEMM by WMMA (tensor core)
☆15Jul 31, 2022Updated 3 years ago
Karbo123 / pytorch_grouped_gemm
View on GitHub
High Performance Grouped GEMM in PyTorch
☆30May 10, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 3 years ago
ztgao / ConvNN_FPGA_Accelerator
View on GitHub
☆14Dec 17, 2015Updated 10 years ago
igamenovoer / houmao
View on GitHub
A framework and CLI toolkit for orchestrating teams of loosely-coupled AI agents.
☆18Updated this week
GinsengHoney / CUDA_Study
View on GitHub
☆18Jul 31, 2023Updated 2 years ago
xueruini / sdcs-testsuit
View on GitHub
Simple test for SDCS
☆11Oct 3, 2025Updated 9 months ago
plctlab / riscv-cluster
View on GitHub
Towards a million-node RISC-V cluster.
☆14Mar 6, 2025Updated last year
2404589803 / hf-daily-paper-newsletter-chinese
View on GitHub
🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…
☆57Updated this week