several types of attention modules written in PyTorch for learning purposes
β53Jan 2, 2026Updated 2 months ago
Alternatives and similar repositories for attention
Users that are interested in attention are comparing it to the libraries listed below
Sorting:
- π§ A study guide to learn about Transformersβ12Jan 11, 2024Updated 2 years ago
- sigma-MoE layerβ21Jan 5, 2024Updated 2 years ago
- β16Nov 4, 2023Updated 2 years ago
- This is a simple torch implementation of the high performance Multi-Query Attentionβ16Aug 23, 2023Updated 2 years ago
- β19Sep 15, 2022Updated 3 years ago
- Spatial Spectral Machine Learningβ14Oct 15, 2025Updated 4 months ago
- Re-implementation of Memory Networks (MemNN) paper of Facebook AI Research Lab.β16May 6, 2020Updated 5 years ago
- β21Oct 12, 2022Updated 3 years ago
- β22Nov 9, 2024Updated last year
- Official PyTorch code for HILAβ28Nov 1, 2022Updated 3 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response formatβ27Jul 12, 2023Updated 2 years ago
- A PyTorch implementation of the Compact Multi-Head Self-Attention Mechanism from the paper: "Low Rank Factorization for Compact Multi-Heaβ¦β26Jan 13, 2020Updated 6 years ago
- Classification models 1D Zoo - Keras and TF.Kerasβ28Jul 18, 2024Updated last year
- Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformerβ27Dec 24, 2023Updated 2 years ago
- Repository for discussion of OpenSAFELY codelistsβ10Sep 10, 2024Updated last year
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- β14Jun 24, 2024Updated last year
- This repository demonstrates how to use TensorFlow based SegFormer model in π€ transformers package.β30Jul 25, 2022Updated 3 years ago
- PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolationβ33Dec 29, 2021Updated 4 years ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"β88Sep 12, 2025Updated 5 months ago
- β12May 24, 2023Updated 2 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch