AlexMan2000 / UCB_EECS_127Links
☆10Updated last year
Alternatives and similar repositories for UCB_EECS_127
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
Sorting:
- ICLR 2025 Agent-Related Papers☆72Updated last year
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated last year
- 带中文导读的PhD申请攻略收集☆79Updated 2 years ago
- I love C++.☆34Updated 11 months ago
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 8 months ago
- all the notes, ppts and homework for CS224n☆128Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆387Updated 4 months ago
- ☆117Updated last week
- [ICML 2025] Official Implementation of GLIDER☆67Updated last month
- ☆81Updated last year
- I love algorithms.☆25Updated 11 months ago
- ☆45Updated last month
- NJUAI-Master-Courses☆28Updated 2 years ago
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆39Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆373Updated last month
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- ☆24Updated 10 months ago
- ☆28Updated 5 years ago
- homework answer for UCB cs285 deepRL☆60Updated 11 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Updated 9 months ago
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆63Updated 6 months ago
- 中国科学院大学2022-2023春季学期自然语言处理课程☆27Updated 2 years ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆348Updated last week
- ☆56Updated 5 months ago
- 一个基于《AI 中的数学》教材内容的 AI 助教系统☆18Updated 6 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆47Updated last week
- The newest solution for CS224n: Stanford NLP.(作业代码实现)☆72Updated 2 years ago
- 中国科学技术大学大数据算法课程笔记2023☆31Updated 2 years ago
- ☆289Updated 11 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆373Updated last month