Implement FlashAttention v2 with minimal code to learn.
☆16Jun 12, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention-v2-minimal
Users that are interested in flash-attention-v2-minimal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flash Attention in raw Cuda C beating PyTorch☆39May 14, 2024Updated 2 years ago
- NES emulator written in pure FreeBASIC with love by Blyss Sarania and Gavin Schulte(Nobbs66).☆21Oct 29, 2025Updated 7 months ago
- a simple tools used to label the homography matrix between two image☆13Jan 13, 2023Updated 3 years ago
- ☆10Mar 14, 2018Updated 8 years ago
- ☆10Aug 10, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Apr 14, 2026Updated 2 months ago
- ☆18Apr 30, 2025Updated last year
- Example for baking the current git commit hash into a bazel C++ project☆11Jan 25, 2022Updated 4 years ago
- Immix GC for LLVM based languages☆17Apr 2, 2025Updated last year
- 🤖 Telegram chatbot frontend for Searx.☆16Nov 25, 2018Updated 7 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)☆33Jul 4, 2024Updated last year
- A rust version of the Caffe library.☆19Jun 16, 2021Updated 5 years ago
- ☆17Apr 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Apr 28, 2023Updated 3 years ago
- ☆30Feb 20, 2024Updated 2 years ago
- E-Graph library☆23Apr 4, 2024Updated 2 years ago
- VGG16 architecture with BatchNorm☆14Apr 4, 2017Updated 9 years ago
- A Google images scraper to collect a labeled face dataset.☆11Oct 24, 2018Updated 7 years ago
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- AI大模型的基本开发框架,适合普通后端程序员,功能类似coze包括:fastapi后端接口,搜索,文档解析和向量化,RPA和爬虫,自定义agent,对接第三方数据接口,mongodb数据库,控制json返回,多模态理解和生成等等☆13Jul 18, 2024Updated last year
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆21Jul 13, 2025Updated 11 months ago
- A deep learning approach to improve the resolution of images☆10Mar 18, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- NetHCF: Enabling Line-rate and Adaptive Spoofed IP Traffic Filtering☆13Mar 17, 2022Updated 4 years ago
- Learn how to create impactful AI Agents using Agno AI Python Package☆13Jul 31, 2025Updated 10 months ago
- ☆15May 3, 2026Updated last month
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- 关于深度学习算法、框架、编译器、加速器的一些理解☆16Jul 2, 2022Updated 3 years ago
- A graph coloring register allocator for LLVM.☆11Jan 23, 2017Updated 9 years ago
- Table of common WiFi router SSIDs with their corresponding router model, WPA key examples, keyspace, format and default web interface cre…☆13Mar 15, 2022Updated 4 years ago
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- Acclaim: Adaptive Memory Reclaim to Improve User Experience in Android Systems [ATC '20]☆16Aug 1, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- seeta face detection for Android☆11Sep 23, 2017Updated 8 years ago
- Boost.Align☆16Apr 22, 2026Updated last month
- LLM inference in C/C++☆21Oct 22, 2025Updated 7 months ago
- HMS - Harmful Brain Activity Classification☆13May 8, 2024Updated 2 years ago
- This repository is used to collect NeRF papers on autonomous driving☆31Apr 12, 2024Updated 2 years ago
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆33Mar 30, 2025Updated last year
- This repository contains the 3D face reconstruction results from a single image.☆16Jun 14, 2018Updated 8 years ago