xianminx / mooc-cs294-llm-agentsLinks
CS294/194-196 Large Language Model Agents
☆38Updated last year
Alternatives and similar repositories for mooc-cs294-llm-agents
Users that are interested in mooc-cs294-llm-agents are comparing it to the libraries listed below
Sorting:
- Notes and commented code for RLHF (PPO)☆120Updated last year
- ☆403Updated 11 months ago
- ☆89Updated 5 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆69Updated 8 months ago
- ☆99Updated last year
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆165Updated last month
- ☆465Updated 3 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆138Updated 7 months ago
- Repository for Zochi's Research☆295Updated last month
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆103Updated 8 months ago
- ☆204Updated 4 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆185Updated this week
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆306Updated 9 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆139Updated last year
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆123Updated last month
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆178Updated 5 months ago
- ☆68Updated last year
- ☆249Updated 4 months ago
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆69Updated 4 months ago
- A banchmark list for evaluation of large language models.☆152Updated 3 months ago
- A brief and partial summary of RLHF algorithms.☆139Updated 9 months ago
- minimal GRPO implementation from scratch☆100Updated 9 months ago
- ☆77Updated 7 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆171Updated 3 months ago
- ☆202Updated 5 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆210Updated last month
- ☆168Updated 2 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆140Updated 4 months ago
- A RL Framework for multi LLM agent system☆83Updated this week
- Distributed training (multi-node) of a Transformer model☆90Updated last year