AlexMan2000 / UCB_EECS_127Links
☆10Updated last year
Alternatives and similar repositories for UCB_EECS_127
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
Sorting:
- ICLR 2025 Agent-Related Papers☆71Updated 6 months ago
- ☆76Updated 9 months ago
- ☆58Updated 2 months ago
- ☆39Updated last week
- homework answer for UCB cs285 deepRL☆44Updated 5 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆35Updated this week
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆20Updated last year
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆39Updated 7 months ago
- Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory☆157Updated this week
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆89Updated this week
- Stanford convex optimization course☆87Updated 4 years ago
- Paper collections of the continuous effort start from World Models.☆172Updated 11 months ago
- one of my CSC courses in CUHK(SZ)☆18Updated 3 years ago
- ☆218Updated 5 months ago
- A comprehensive collection of process reward models.☆85Updated 2 weeks ago
- ☆13Updated this week
- A curated list of personalized alignment resources (continually updated).☆22Updated last week
- 带中文导读的PhD申请攻略收集☆64Updated last year
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆54Updated 10 months ago
- ☆19Updated last week
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- ✍️ The notes of courses in Shanghai Jiao Tong University☆171Updated 3 years ago
- SJTU2022Fall 电类工程导论(C类) Instructor: 张娅,何大治☆11Updated last year
- ☆38Updated 3 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆228Updated this week
- ☆33Updated last week
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆40Updated 2 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆212Updated this week
- 此项目用于归档本人在上海交通大学计算机科学与技术专业学习过程中的作业与项目。☆38Updated 10 months ago
- A Boost Based C++ HTTP JSON Server☆14Updated last year