Fast and memory-efficient exact attention
☆15Feb 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆15Oct 13, 2023Updated 2 years ago
- arok ui☆10Jun 20, 2023Updated 2 years ago
- Seizure prediction using EEG data from CHB MIT dataset using modern Deep Learning techniques. It will be organized and easy to replicate.☆10Apr 3, 2023Updated 2 years ago
- The code of paper 'Dual Relation Knowledge Distillation for Object Detection' IJCAI 2023☆15Feb 19, 2024Updated 2 years ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- official repository for ‘ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation (ICME 2024)’☆16Mar 14, 2024Updated last year
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- ☆16Nov 21, 2025Updated 3 months ago
- RenderToy2 is the Monte-Carlo-estimated PBR final project for Advanced Computer Graphics (Fall 2023, IIIS, Tsinghua University). Course g…☆12Dec 13, 2023Updated 2 years ago
- ☆13May 25, 2022Updated 3 years ago
- A GitHub Action to run a cpplint command when new code is pushed into your repo☆11Mar 24, 2025Updated 11 months ago
- Confidence driven image fusion based on TGV regularization☆11Dec 24, 2017Updated 8 years ago
- The normative reference documentation for the Slang programming language.☆16Feb 19, 2026Updated last week
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- nethserver-arm issue tracker☆10Sep 25, 2021Updated 4 years ago
- A conversion tool for converting OpenVDB files to NanoVDB files☆10Nov 25, 2022Updated 3 years ago
- Documentations for RELION☆14Nov 12, 2025Updated 3 months ago
- Fork of LLVM Project containing a Colossus IPU backend implementation☆13Feb 2, 2026Updated last month
- 简单快速的部署深度学习模型☆13Sep 3, 2023Updated 2 years ago
- 关于无锁队列的知识☆11Feb 13, 2017Updated 9 years ago
- Code to reproduce the experiments described in "Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Dupl…☆14Oct 16, 2023Updated 2 years ago
- Just a template for quickly creating a python library.☆10Feb 8, 2026Updated 3 weeks ago
- This is the official Mitsuba3 implementation of "Doppler Time-of-Flight Rendering" (SIGGRAPH Asia 2023)☆18Jul 8, 2024Updated last year
- A C++ library for parsing and HLS playlists and demuxing Transport Streams☆11Aug 23, 2022Updated 3 years ago
- ☆11Apr 5, 2021Updated 4 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Data Files for "Deep diversification of an AAV capsid protein by machine learning"☆18Mar 9, 2021Updated 4 years ago
- LiDAR SiMulator v2: Interactive 2D LiDAR scanner simulator version II. Implemented via Rust and CUDA.☆13Jul 18, 2023Updated 2 years ago
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Jan 22, 2026Updated last month
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆13Mar 12, 2019Updated 6 years ago
- A conda-smithy repository for ambertools.☆11Feb 26, 2025Updated last year
- ☆11Nov 17, 2022Updated 3 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 3 months ago
- ☆11Jun 13, 2025Updated 8 months ago
- ☆12Jan 7, 2023Updated 3 years ago
- Simple Arm assembly kernels for testing the performance and functionality of Arm CPUs.☆13Dec 3, 2023Updated 2 years ago
- variant type for CUDA☆12Nov 14, 2015Updated 10 years ago
- Simple OpenGL-based renderer for volumetric (medical) data☆16Jun 4, 2025Updated 8 months ago