LeapLabTHU / FamO2OLinks
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆37Updated last year
Alternatives and similar repositories for FamO2O
Users that are interested in FamO2O are comparing it to the libraries listed below
Sorting:
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆22Updated 2 years ago
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆23Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆67Updated 11 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆81Updated 3 months ago
- ☆13Updated 8 months ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆12Updated 10 months ago
- Implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆25Updated 2 weeks ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆39Updated 9 months ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆22Updated 2 months ago
- ☆13Updated last year
- ☆16Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆39Updated last year
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆79Updated 2 months ago
- ☆60Updated last year
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆13Updated 2 weeks ago
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆34Updated 11 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆104Updated 6 months ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆45Updated 10 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆108Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆141Updated 3 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆36Updated 10 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆94Updated 2 months ago
- ☆33Updated 2 years ago
- [ICLR 2025 Spotlight] Official PyTorch Implementation of "What Makes a Good Diffusion Planner for Decision Making?"☆71Updated 4 months ago
- ☆18Updated last year
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆28Updated 2 years ago
- ☆17Updated 5 months ago
- Code for Stable Control Representations☆25Updated 4 months ago