AlexMan2000 / UCB_EECS_127Links
☆10Updated last year
Alternatives and similar repositories for UCB_EECS_127
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
Sorting:
- ICLR 2025 Agent-Related Papers☆73Updated 11 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated last year
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆42Updated 2 weeks ago
- Stanford convex optimization course☆104Updated 4 years ago
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 7 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- ☆81Updated last year
- homework answer for UCB cs285 deepRL☆58Updated 10 months ago
- 带中文导读的PhD申请攻略收集☆78Updated 2 years ago
- [ICML 2025] Official Implementation of GLIDER☆64Updated 3 weeks ago
- I love C++.☆35Updated 10 months ago
- ☆25Updated 2 years ago
- one of my CSC courses in CUHK(SZ)☆19Updated 3 years ago
- ☆24Updated 3 weeks ago
- 清华大学飞跃手册☆389Updated last month
- ☆278Updated 10 months ago
- SJTU Canvas 视频 (批量) 下载器☆121Updated 5 months ago
- Course Archive for AI Major in SJTU☆85Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆326Updated this week
- All hail, Thy Highest University (THU)☆43Updated 6 years ago
- 中国科学技术大学大数据算法课程笔记2023☆31Updated 2 years ago
- ☆602Updated 3 years ago
- ☆24Updated 9 months ago
- 思源码消费年度总结☆154Updated 10 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆1,951Updated this week
- all the notes, ppts and homework for CS224n☆125Updated last year
- UCB 285 Deep Reinforcement Learning (Fall 2023) Homeworks☆13Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆32Updated 2 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆350Updated 3 weeks ago
- ☆192Updated 3 months ago