Rishit-dagli / Compositional-Attention
An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA
☆14Updated 2 years ago
Alternatives and similar repositories for Compositional-Attention:
Users that are interested in Compositional-Attention are comparing it to the libraries listed below
- Directed masked autoencoders☆14Updated 2 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- ☆17Updated last week
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Updated 3 years ago
- Kaggle fashion dataset in dalle format☆13Updated 3 years ago
- ☆15Updated 2 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- ☆9Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Visual search interface☆11Updated 3 years ago
- Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxi…☆13Updated 3 years ago