Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆52Jun 11, 2025Updated last year
Alternatives and similar repositories for recurrent-fwp
Users that are interested in recurrent-fwp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…☆177Jun 11, 2025Updated last year
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆28Aug 19, 2023Updated 2 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆115Jun 10, 2021Updated 5 years ago
- PyTorch Language Modeling Toolkit for Fast Weight Programmers☆22Jun 11, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Mar 13, 2023Updated 3 years ago
- ☆15May 14, 2019Updated 7 years ago
- This bot crawls and downloads statistics and pictures from google scholar's researchers.☆21Apr 27, 2023Updated 3 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆15Sep 26, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆30Feb 25, 2021Updated 5 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated 2 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆30Sep 25, 2021Updated 4 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated last year
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆23Dec 7, 2024Updated last year
- [ICLR'20] [PyTorch] Inverted Attention Routing for Capsules☆30Feb 26, 2020Updated 6 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated 2 years ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆68Apr 24, 2024Updated 2 years ago
- ☆18Aug 3, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- S.M.Ali Eslam et.al. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models ICML16☆14Sep 27, 2018Updated 7 years ago
- Implementation of the user-space eBPF VM based on the iovisor version (https://github.com/iovisor/ubpf)☆13Apr 16, 2020Updated 6 years ago
- Implementation of iterative inference in deep latent variable models