AlexMan2000 / UCB_EECS_127
☆10Updated last year
Alternatives and similar repositories for UCB_EECS_127:
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆20Updated last year
- ICLR 2025 Agent-Related Papers☆67Updated 5 months ago
- SJTU2022Fall 电类工程导论(C类) Instructor: 张娅,何大治☆11Updated last year
- ☆212Updated 5 months ago
- UCB 285 Deep Reinforcement Learning (Fall 2023) Homeworks☆13Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆39Updated last month
- 带中文导读的PhD申请攻略收集☆62Updated last year
- A survey of Preference Reinforcement Learning☆9Updated last year
- homework answer for UCB cs285 deepRL☆42Updated 4 months ago
- ☆37Updated this week
- ☆76Updated 8 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆34Updated last month
- ☆57Updated last month
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆53Updated 9 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated last year
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆39Updated 6 months ago
- A comprehensive collection of process reward models.☆76Updated this week
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆159Updated 4 months ago
- Overseas Summer Research Guidance 海外暑研申请指南☆286Updated 6 months ago
- ☆78Updated 8 months ago
- NJUAI-Master-Courses☆25Updated last year
- ☆19Updated this week
- ☆26Updated last month
- 南京大学人工智能学院本科生开放日面试经验分享☆28Updated 2 years ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- ☆11Updated last month
- LLM multi-agent discussion framework for multi-agent/robot situations.☆34Updated 7 months ago
- papers related to Direct Preference Optimization(DPO)☆18Updated 9 months ago
- ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models☆74Updated last month
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆10Updated 2 months ago