LeapLabTHU / FamO2O
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆37Updated last year
Related projects ⓘ
Alternatives and complementary repositories for FamO2O
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆21Updated 2 years ago
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆21Updated last year
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆22Updated last week
- [NeurIPS 2023] Efficient Diffusion Policy☆82Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆55Updated last month
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆71Updated 7 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆68Updated last month
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆39Updated last year
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆41Updated 6 months ago
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆32Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆40Updated 9 months ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆31Updated 2 months ago
- Instruction Following Agents with Multimodal Transforemrs☆51Updated 2 years ago
- ☆16Updated 3 weeks ago
- Implantation of CtrlFormer☆28Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 3 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆32Updated 3 weeks ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆26Updated last year
- Masked World Models for Visual Control☆118Updated last year
- ☆86Updated 2 years ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated last year
- ☆45Updated 9 months ago
- ☆68Updated 2 months ago
- ☆16Updated 7 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆76Updated 3 months ago