kuixu / Linear-Multihead-AttentionView external linksLinks
Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
☆75Jun 23, 2020Updated 5 years ago
Alternatives and similar repositories for Linear-Multihead-Attention
Users that are interested in Linear-Multihead-Attention are comparing it to the libraries listed below
Sorting:
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- My take on a practical implementation of Linformer for Pytorch.☆422Jul 27, 2022Updated 3 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Attention mechanism☆52Sep 13, 2021Updated 4 years ago
- ☆24Nov 21, 2023Updated 2 years ago
- An autoregressive model for point cloud generation augmented with self-attention☆28Mar 20, 2020Updated 5 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆29May 22, 2022Updated 3 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Multiple Anchor Learning for Visual Object Detection (CVPR,2020)☆14Mar 18, 2021Updated 4 years ago
- ☆12Dec 23, 2019Updated 6 years ago
- This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …☆11Jan 22, 2020Updated 6 years ago
- ☆11Apr 18, 2021Updated 4 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Implementation of Linformer for Pytorch☆305Jan 5, 2024Updated 2 years ago
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆26Aug 27, 2019Updated 6 years ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 5 years ago
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Jan 13, 2021Updated 5 years ago
- Code repo for the paper "Semantic Correspondence via 2D-3D-2D Cycle"☆12Jan 28, 2021Updated 5 years ago
- GAN(TK)²: GAN Neural Tangent Kernel ToolKit☆13Jul 12, 2022Updated 3 years ago
- Semi-Supervised Video Object Segmentation(VOS) Paper List☆27Nov 3, 2020Updated 5 years ago
- ☆254Oct 4, 2022Updated 3 years ago
- Generalizable Semantic Segmentation viaModel-agnostic Learning and Target-specificNormalization☆28Apr 21, 2020Updated 5 years ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Apr 22, 2022Updated 3 years ago
- DeepNCM: Deep Nearest Class Mean Classifiers☆13Dec 20, 2018Updated 7 years ago
- ☆13Nov 8, 2022Updated 3 years ago
- NeurIPS 2019 Paper Implementation☆12Nov 22, 2022Updated 3 years ago
- ☆16Oct 3, 2023Updated 2 years ago
- For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》☆28Feb 23, 2020Updated 5 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 10 months ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Nov 29, 2021Updated 4 years ago
- (unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets☆12Aug 22, 2020Updated 5 years ago
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 4 years ago
- Repo of "MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds".☆39Sep 20, 2023Updated 2 years ago
- ☆196Feb 14, 2023Updated 3 years ago
- Code for RANet: Region Attention Network for Semantic Segmentation☆33May 26, 2021Updated 4 years ago
- ☆98Apr 27, 2022Updated 3 years ago
- Code for our paper "Prune and Replace NAS"☆17Jun 26, 2019Updated 6 years ago
- ☆16May 6, 2021Updated 4 years ago