A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
☆37Feb 7, 2023Updated 3 years ago
Alternatives and similar repositories for transformer-xl
Users that are interested in transformer-xl are comparing it to the libraries listed below
Sorting:
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 3 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Pointax: PointMaze Environment for JAX☆26Oct 22, 2025Updated 4 months ago
- [ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…☆17Jun 12, 2025Updated 8 months ago
- ☆16Jul 16, 2024Updated last year
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆26Jan 14, 2025Updated last year
- ☆19May 20, 2025Updated 9 months ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 4 months ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆33Jul 8, 2025Updated 7 months ago
- ☆19Apr 22, 2024Updated last year
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆105Sep 29, 2025Updated 5 months ago
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Updated this week
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆31Nov 4, 2024Updated last year
- ☆24Aug 9, 2024Updated last year
- Reinforcement Learning inside a 3D soccer simulation☆37Sep 15, 2024Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆205Jun 18, 2024Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆41Sep 3, 2025Updated 6 months ago
- Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlig…☆34Jan 16, 2025Updated last year
- ☆31Jun 21, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- ☆92Feb 16, 2026Updated 2 weeks ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆38Jun 3, 2023Updated 2 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- ☆11Nov 10, 2020Updated 5 years ago
- Implementation of data dimensionality reduction algorithms SVD and CUR without using library functions.☆10Jul 24, 2017Updated 8 years ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆23Dec 12, 2025Updated 2 months ago
- ☆11Aug 4, 2022Updated 3 years ago
- Multi Stopwatch for Python☆12Sep 28, 2019Updated 6 years ago
- Supporting material for Princeton ORF307☆12Jan 14, 2026Updated last month