HKUNLP/efficient-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HKUNLP/efficient-attention)

HKUNLP / efficient-attention

[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

☆86

Alternatives and similar repositories for efficient-attention

Users that are interested in efficient-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shark-NLP / EVALM
View on GitHub
Official codebase for “In-Context Learning with Many Demonstration Examples”
☆16Feb 13, 2023Updated 3 years ago
LZhengisme / self-infilling
View on GitHub
[ICML 2024] Self-Infilling Code Generation
☆18May 5, 2024Updated 2 years ago
LZhengisme / CODA
View on GitHub
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Sep 16, 2021Updated 4 years ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
HKUNLP / subgoal-theorem-prover
View on GitHub
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆20May 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HKUNLP / hkunlp.github.io
View on GitHub
Website for HKU NLP group (under construction)
☆14Jul 6, 2026Updated 2 weeks ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
Shark-NLP / CAB
View on GitHub
☆31Jul 2, 2023Updated 3 years ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
Noahs-ARK / RFA
View on GitHub
☆33Apr 12, 2021Updated 5 years ago
HKUNLP / reparam-discrete-diffusion
View on GitHub
Reparameterized Discrete Diffusion Models for Text Generation
☆108Feb 14, 2023Updated 3 years ago
hyp1231 / ICLR2023-OpenReviewData
View on GitHub
Crawl & visualize ICLR papers and reviews.
☆18Nov 5, 2022Updated 3 years ago
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
facebookresearch / mbr-exec
View on GitHub
code for "Natural Language to Code Translation with Execution"
☆41Nov 2, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zsLin177 / SRL-as-GP
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
VITA-Group / Data-Efficient-Scaling
View on GitHub
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
☆14Jan 4, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
zhao-ht / ConvexCertify
View on GitHub
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Dec 6, 2022Updated 3 years ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
ictnlp / HMT
View on GitHub
Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"
☆24Dec 11, 2023Updated 2 years ago
VPeterV / RankSpace-Models
View on GitHub
source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"
☆10Sep 26, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HKUNLP / RSA
View on GitHub
Retrieved Sequence Augmentation for Protein Representation Learning
☆52Nov 1, 2023Updated 2 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
Shark-NLP / CoNT
View on GitHub
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆152May 10, 2023Updated 3 years ago
AndPotap / einsum-search
View on GitHub
☆34Oct 4, 2024Updated last year
iesl / s-diora
View on GitHub
☆12Jan 29, 2021Updated 5 years ago
HKUNLP / SymGen
View on GitHub
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
☆18Oct 21, 2023Updated 2 years ago
Timothyxxx / NeuralSymbolicPapers
View on GitHub
☆14Aug 18, 2022Updated 3 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
thjashin / rodeo
View on GitHub
Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)
☆18Nov 14, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
RakitinDen / pytorch-recursive-gumbel-max-trick
View on GitHub
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021
☆14Dec 11, 2021Updated 4 years ago
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 10 months ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
ShannonAI / mrc-for-dependency-parsing
View on GitHub
☆18May 28, 2021Updated 5 years ago
priyankjaini / discFlowMH
View on GitHub
Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC
☆11Mar 1, 2021Updated 5 years ago
Yinghao-Li / CHMM-ALT
View on GitHub
Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"
☆32Jun 20, 2023Updated 3 years ago
FadedCosine / Dependency-Guided-Neural-Text-Generation
View on GitHub
Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…
☆26May 27, 2022Updated 4 years ago