Fast and memory-efficient exact attention
☆19Mar 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆15Oct 13, 2023Updated 2 years ago
- ☆17May 14, 2024Updated last year
- ☆11Nov 17, 2022Updated 3 years ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆17Mar 26, 2025Updated 11 months ago
- ☆17Feb 5, 2023Updated 3 years ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- A tool to convert ADE20K to COCO format based on desired objects☆15Jun 10, 2024Updated last year
- arok ui☆10Jun 20, 2023Updated 2 years ago
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆13Mar 12, 2019Updated 7 years ago
- ☆11Apr 5, 2021Updated 4 years ago
- A C++ library for parsing and HLS playlists and demuxing Transport Streams☆11Aug 23, 2022Updated 3 years ago
- [NeurIPS 2025 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆40Dec 26, 2025Updated 2 months ago
- 演讲《GPU 驱动的发行版适配》☆11Mar 18, 2024Updated 2 years ago
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- SOTA benchmark☆18Aug 8, 2023Updated 2 years ago
- CORE: Automatic Molecule Optimization using Copy & Refine Strategy (AAAI 2020)☆17Jul 17, 2023Updated 2 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 2 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆25Feb 18, 2025Updated last year
- A GitHub Action to run a cpplint command when new code is pushed into your repo☆11Mar 24, 2025Updated last year
- ncnn android robust video matting☆22Jan 14, 2026Updated 2 months ago
- nethserver-arm issue tracker☆10Sep 25, 2021Updated 4 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- ☆14May 25, 2022Updated 3 years ago
- 关于无锁队列的知识☆11Feb 13, 2017Updated 9 years ago
- ☆14Jul 21, 2022Updated 3 years ago
- A conda-smithy repository for ambertools.☆11Updated this week
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- ☆13Jun 18, 2023Updated 2 years ago
- Simple Arm assembly kernels for testing the performance and functionality of Arm CPUs.☆14Dec 3, 2023Updated 2 years ago
- 图像自动标注☆28May 7, 2018Updated 7 years ago
- Fork of LLVM Project containing a Colossus IPU backend implementation☆13Mar 11, 2026Updated last week
- useful text recognition algorithms, CRNN and SVTR text recognition☆29Feb 10, 2023Updated 3 years ago
- Just a template for quickly creating a python library.☆10Mar 16, 2026Updated last week
- Run cpplint with reviewdog☆13Jan 25, 2026Updated last month
- GAMES101-现代计算机图形学入门-闫令琪☆11Mar 6, 2021Updated 5 years ago
- ☆13Nov 25, 2021Updated 4 years ago
- Documentations for RELION☆14Mar 13, 2026Updated last week
- PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruish…☆18Nov 10, 2021Updated 4 years ago
- 学习 Flutter 路上的点滴及小测~☆18Jun 24, 2024Updated last year