alfassy / FETA
☆21Updated 6 months ago
Related projects: ⓘ
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆15Updated 10 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆32Updated last year
- ☆26Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- ☆20Updated 5 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆50Updated last year
- ☆32Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆33Updated 3 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Updated 2 years ago
- ☆22Updated 2 months ago
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆33Updated 11 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- Code to reproduce the results for Compositional Attention☆60Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆30Updated last year
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆14Updated last year
- codebase for the SIMAT dataset and evaluation