mohmdelsayed / weight-clippingLinks
☆18Updated 4 months ago
Alternatives and similar repositories for weight-clipping
Users that are interested in weight-clipping are comparing it to the libraries listed below
Sorting:
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆82Updated 2 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆104Updated 3 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆82Updated last year
- ☆114Updated 10 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆130Updated 6 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- ☆120Updated last month
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆80Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆91Updated last year
- ☆31Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆116Updated last year
- Repo for Implicit Diffusion Q-Learning☆121Updated 2 years ago
- Transformer-based World Models☆87Updated 2 years ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆41Updated last year
- Official implementation of the BRO algorithm☆53Updated 11 months ago
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆45Updated last year
- Clean single-file implementation of offline RL algorithms in JAX☆165Updated last month
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆46Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆22Updated 11 months ago
- Skeleton for scalable and flexible Jax RL implementations☆93Updated 2 years ago
- off-policy RL on long sequences☆155Updated last week
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆28Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆61Updated 2 years ago
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆43Updated 10 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆179Updated 5 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 5 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Updated last year
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆15Updated last year
- Meta-RL Model-Based Algorithm☆41Updated 8 months ago
- ☆47Updated 3 months ago