Fast and memory-efficient exact attention
☆29Dec 2, 2024Updated last year
Alternatives and similar repositories for flash-attention-3
Users that are interested in flash-attention-3 are comparing it to the libraries listed below
Sorting:
- Basic world models☆31Oct 30, 2025Updated 4 months ago
- ☆23Jun 18, 2024Updated last year
- LLM checkpointing for DeepSpeed/Megatron☆25Nov 30, 2025Updated 3 months ago
- ☆27May 3, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- MCP server for Google search and page fetching using headless Chromium☆67Feb 21, 2026Updated last week
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Nov 9, 2023Updated 2 years ago
- an autonomous independent digital companion☆14Feb 12, 2026Updated 2 weeks ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- Cookiecutter template for making a cog for Red.☆12Jun 18, 2024Updated last year
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆15Feb 10, 2026Updated 2 weeks ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- Hi, I'm Harmony the Hummingbird! Let's work together on whatever you care about.☆12May 3, 2024Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Feb 6, 2026Updated 3 weeks ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- openapi-documented arcgis proxy & geospatial data discovery server☆15Dec 15, 2025Updated 2 months ago
- ☆12May 20, 2025Updated 9 months ago
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- record your keypress counts into a database to track computer usage☆12Dec 25, 2025Updated 2 months ago
- A Next.js v15+ template with Tailwind v3+, featuring Microsoft Entra ID authentication via Next-Auth v5+ and a Microsoft Graph Client int…☆10Jan 28, 2026Updated last month
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆25Jul 26, 2025Updated 7 months ago
- Basic floating-point components for RISC-V processors☆11Aug 13, 2017Updated 8 years ago
- Stable Diffusion Documents☆11Aug 22, 2023Updated 2 years ago
- ☆48Feb 23, 2025Updated last year
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- ☆10Jun 4, 2024Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 4 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more☆12Feb 16, 2026Updated 2 weeks ago
- The Ultimate OpenCode Starter Kit. Includes Oh My OpenCode config, Superpowers installation fix, MCP Setup, and Windows Crash Fix (exit_c…☆17Feb 10, 2026Updated 2 weeks ago
- ☆13Jun 3, 2024Updated last year
- Advanced Video Graph RAG using SAM2,CLIP,BLIP,Qwen2-VL,YOLO-World ,Neo4j, WebGPU, local LLM☆14Nov 25, 2024Updated last year