AlexMan2000 / UCB_EECS_127Links
☆10Updated last year
Alternatives and similar repositories for UCB_EECS_127
Users that are interested in UCB_EECS_127 are comparing it to the libraries listed below
Sorting:
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆20Updated last year
- ICLR 2025 Agent-Related Papers☆73Updated 10 months ago
- I love C++.☆35Updated 9 months ago
- ☆81Updated last year
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 6 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆41Updated 4 months ago
- one of my CSC courses in CUHK(SZ)☆19Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆49Updated 6 months ago
- 2020北京大学数据结构与算法(A)部分作业Archive☆11Updated 4 years ago
- UCB 285 Deep Reinforcement Learning (Fall 2023) Homeworks☆13Updated last year
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Updated 11 months ago
- one of my CSC courses in CUHK(SZ)☆19Updated 3 years ago
- ☆110Updated 3 weeks ago
- ☆16Updated last year
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Updated 7 months ago
- homework answer for UCB cs285 deepRL☆54Updated 9 months ago
- ☆21Updated 2 months ago
- Stanford convex optimization course☆101Updated 4 years ago
- I love algorithms.☆25Updated 9 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆493Updated last year
- ☆267Updated 10 months ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆32Updated last month
- all the notes, ppts and homework for CS224n☆121Updated last year
- ☆45Updated last month
- Awesome RL-based LLM Reasoning☆636Updated 2 months ago
- ☆22Updated this week
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆289Updated this week
- Paper list for Efficient Reasoning.☆679Updated 3 weeks ago
- Course Archive for AI Major in SJTU☆81Updated last year
- 交大自动化-人工智能资料汇总(持续更新)☆41Updated 4 years ago