bergen / EdgeTransformer
☆22Updated 3 years ago
Alternatives and similar repositories for EdgeTransformer:
Users that are interested in EdgeTransformer are comparing it to the libraries listed below
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆11Updated 9 months ago
- ☆32Updated 3 years ago
- Code to reproduce the results for Compositional Attention☆60Updated 2 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆26Updated 3 years ago
- ☆18Updated 6 months ago
- ☆31Updated 11 months ago
- Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization☆14Updated 2 years ago
- ☆39Updated 2 years ago
- ☆49Updated last year
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆63Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆24Updated last month
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆12Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- lanmt ebm☆11Updated 4 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Updated 4 years ago
- ☆18Updated 2 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 8 months ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆70Updated last year
- ☆50Updated 3 years ago
- Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…☆17Updated 4 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆11Updated 3 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆25Updated 9 months ago
- ☆28Updated last year
- Stick-breaking attention☆37Updated this week
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆11Updated 3 months ago