Rishit-dagli / Compositional-AttentionLinks

An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA

☆14

Alternatives and similar repositories for Compositional-Attention

Users that are interested in Compositional-Attention are comparing it to the libraries listed below

Sorting:

facebookresearch / dmae_st
Directed masked autoencoders
☆14Updated 2 years ago
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆26Updated 3 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
ChristophReich1996 / HyperMixer
PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].
☆17Updated 3 years ago
simran-arora / focus
This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"
☆24Updated 2 years ago
ekinakyurek / google-research
Google Research
☆46Updated 2 years ago
lucidrains / holodeck-pytorch
Implementation of a holodeck, written in Pytorch
☆18Updated last year
allenai / smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆33Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 6 months ago
jxbz / entropix
📰 Computing the information content of trained neural networks
☆21Updated 3 years ago
EleutherAI / magiCARP
One stop shop for all things carp
☆59Updated 2 years ago
salesforce / adversarial-polyglots
Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)
☆10Updated 3 months ago
lucidrains / esbn-transformer
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Updated 4 years ago
facebookresearch / tce
Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.
☆13Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
AranKomat / Metroplex
☆21Updated 2 years ago
lucidrains / multistream-transformers
Implementation of Multistream Transformers in Pytorch
☆54Updated 4 years ago
sayakpaul / BiT-jax2tf
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Updated 3 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
Avmb / inverse_scaling_prize_code_identifier_swap
Submission to the inverse scaling prize
☆23Updated 2 years ago
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 8 months ago
sayakpaul / MLPMixer-jax2tf
This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.
☆15Updated 3 years ago
tchaton / pytorch2lightning
☆15Updated 4 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago