Implement FlashAttention v2 with minimal code to learn.
☆15Jun 12, 2024Updated last year
Alternatives and similar repositories for flash-attention-v2-minimal
Users that are interested in flash-attention-v2-minimal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flash Attention in raw Cuda C beating PyTorch☆38May 14, 2024Updated last year
- NES emulator written in pure FreeBASIC with love by Blyss Sarania and Gavin Schulte(Nobbs66).☆21Oct 29, 2025Updated 5 months ago
- a simple tools used to label the homography matrix between two image☆13Jan 13, 2023Updated 3 years ago
- ☆10Mar 14, 2018Updated 8 years ago
- ☆10Aug 10, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Apr 30, 2025Updated 11 months ago
- ☆11Dec 31, 2019Updated 6 years ago
- Immix GC for LLVM based languages☆15Apr 2, 2025Updated 11 months ago
- Example for baking the current git commit hash into a bazel C++ project☆11Jan 25, 2022Updated 4 years ago
- Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)☆31Jul 4, 2024Updated last year
- 🤖 Telegram chatbot frontend for Searx.☆15Nov 25, 2018Updated 7 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- ☆14Mar 21, 2026Updated last week
- A rust version of the Caffe library.☆19Jun 16, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Apr 28, 2023Updated 2 years ago
- ☆17Apr 9, 2025Updated 11 months ago
- ☆29Feb 20, 2024Updated 2 years ago
- E-Graph library☆22Apr 4, 2024Updated last year
- VGG16 architecture with BatchNorm☆14Apr 4, 2017Updated 8 years ago
- A Google images scraper to collect a labeled face dataset.☆11Oct 24, 2018Updated 7 years ago
- ☆13Jan 16, 2026Updated 2 months ago
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- AI大模型的基本开发框架,适合普通后端程序员,功能类似coze包括:fastapi后端接口,搜索,文档解析和向量化,RPA和爬虫,自定义agent,对接第三方数据接口,mongodb数据库,控制json返回,多模态理解和生成等等☆13Jul 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 8 months ago
- NetHCF: Enabling Line-rate and Adaptive Spoofed IP Traffic Filtering☆13Mar 17, 2022Updated 4 years ago
- Learn how to create impactful AI Agents using Agno AI Python Package☆13Jul 31, 2025Updated 7 months ago
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- 关于深度学习算法、框架、编译器、加速器的一些理解☆16Jul 2, 2022Updated 3 years ago
- A graph coloring register allocator for LLVM.☆11Jan 23, 2017Updated 9 years ago
- Table of common WiFi router SSIDs with their corresponding router model, WPA key examples, keyspace, format and default web interface cre…☆13Mar 15, 2022Updated 4 years ago
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- Acclaim: Adaptive Memory Reclaim to Improve User Experience in Android Systems [ATC '20]☆16Aug 1, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- seeta face detection for Android☆11Sep 23, 2017Updated 8 years ago
- Boost.Align☆16Mar 11, 2026Updated 2 weeks ago
- LLM inference in C/C++☆20Oct 22, 2025Updated 5 months ago
- HMS - Harmful Brain Activity Classification☆13May 8, 2024Updated last year
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆31Mar 30, 2025Updated last year
- This repository is used to collect NeRF papers on autonomous driving☆31Apr 12, 2024Updated last year
- This repository contains the 3D face reconstruction results from a single image.☆16Jun 14, 2018Updated 7 years ago