xihuai18 / arxiv-sanity-x
☆16Updated last month
Related projects: ⓘ
- ☆40Updated 10 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆25Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆32Updated 5 months ago
- This repository is a collection of research papers on World Models.☆28Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated 6 months ago
- ☆102Updated 2 months ago
- Paper collections of the continuous effort start from World Models.☆127Updated 2 months ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆68Updated 5 months ago
- AI Alignment: A Comprehensive Survey☆123Updated 10 months ago
- ICLR 2024 OpenReivew Submission Data☆131Updated 10 months ago
- ☆11Updated 11 months ago
- ☆52Updated 8 months ago
- ☆32Updated last year
- ☆28Updated 5 months ago
- ICLR2024 statistics☆45Updated 9 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆89Updated 2 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆63Updated last year
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆69Updated last month
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆82Updated last year
- ☆22Updated 10 months ago
- ☆11Updated last week
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆12Updated 6 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆24Updated last month
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 5 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆86Updated 3 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆85Updated last week
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆38Updated last month
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆66Updated 2 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆85Updated this week