AlexMan2000 / UCB_EECS_127Links
☆11Updated 2 years ago
Alternatives and similar repositories for UCB_EECS_127
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
Sorting:
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated 2 years ago
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Updated last year
- ☆316Updated last year
- 带中文导读的PhD申请攻略收集☆84Updated 2 years ago
- ICLR 2025 Agent-Related Papers☆75Updated last year
- ☆83Updated last year
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆52Updated 2 months ago
- all the notes, ppts and homework for CS224n☆137Updated last year
- UCB 285 Deep Reinforcement Learning (Fall 2023) Homeworks☆13Updated 2 years ago
- ☆125Updated last month
- homework answer for UCB cs285 deepRL☆69Updated last year
- All hail, Thy Highest University (THU)☆46Updated 7 years ago
- I love C++.☆38Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆419Updated 7 months ago
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 10 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆63Updated last year
- ☆24Updated 4 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆51Updated 10 months ago
- Paper list for Efficient Reasoning.☆822Updated last week
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆68Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆418Updated 2 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆173Updated last year
- SJTU Canvas 视频 (批量) 下载器☆131Updated 8 months ago
- The newest solution for CS224n: Stanford NLP.(作业代码实现)☆74Updated 2 years ago
- [ICML 2025] Official Implementation of GLIDER☆72Updated 4 months ago
- What if you need more exercises?☆31Updated last year
- PPO in one file☆27Updated last year
- ☆28Updated 5 years ago
- Course Archive for AI Major in SJTU☆88Updated last year
- Introduction to Computation☆84Updated last year