Gaiejj / omniairl
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆11Updated last year
Related projects: ⓘ
- ICLR 2024 论文和开源项目合集☆78Updated 4 months ago
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆66Updated 2 months ago
- ☆47Updated last week
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Updated 7 months ago
- ☆16Updated 5 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆24Updated last month
- ☆16Updated 2 years ago
- Survey on Data-centric Large Language Models☆58Updated 2 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆81Updated 5 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆176Updated last week
- [ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆44Updated last month
- ICLR2024 statistics☆45Updated 9 months ago
- 🎉🎨 This repository contains a reading list of papers with code on **Meta-Learning** and ***Meta-Reinforcement-Learning*☆30Updated 6 months ago
- ICLR 2024 OpenReivew Submission Data☆131Updated 10 months ago
- Paper collections of the continuous effort start from World Models.☆127Updated 2 months ago
- ☆119Updated last week
- [CVPR2024] This is the official implement of MP5☆72Updated 2 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆12Updated 8 months ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated 10 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.☆12Updated 2 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆23Updated 4 months ago
- Some experiences for new researchers to grow grow up☆33Updated last year
- ☆32Updated last month
- Documents used for grad school application☆280Updated 3 years ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆16Updated 6 months ago
- Reviews of part of courses of AI☆22Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆26Updated 4 months ago
- ☆53Updated 2 months ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆27Updated last week
- ☆30Updated last week