LeapLabTHU / FamO2O
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆37Updated last year
Related projects ⓘ
Alternatives and complementary repositories for FamO2O
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆21Updated last year
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆21Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆81Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆54Updated last month
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆71Updated 7 months ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆40Updated 6 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated 7 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- Official code repository for Prompt-DT.☆96Updated 2 years ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆32Updated last week
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆67Updated last month
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆26Updated last year
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆32Updated last year
- Instruction Following Agents with Multimodal Transforemrs☆50Updated 2 years ago
- ☆51Updated 8 months ago
- Implantation of CtrlFormer☆28Updated 2 years ago
- ☆33Updated last year
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆31Updated last month
- ☆16Updated last week
- An RL-Friendly Vision-Language Model for Minecraft☆25Updated 3 weeks ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 8 months ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆51Updated last month
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆26Updated 4 months ago
- [NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning the…☆58Updated 3 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆74Updated 3 months ago
- ☆16Updated 6 months ago
- Masked World Models for Visual Control☆118Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 3 months ago