zhaoxlpku / DASC7606-A3Links
☆13Updated last year
Alternatives and similar repositories for DASC7606-A3
Users that are interested in DASC7606-A3 are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ☆15Updated last year
- Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)☆1,152Updated 3 weeks ago
- everything about llm & aigc☆109Updated last week
- A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.☆738Updated 11 months ago
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆110Updated last year
- ☆79Updated 2 months ago
- A Survey on Large Language Model-Based Game Agents☆772Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,025Updated 3 weeks ago
- ☆466Updated 4 months ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆159Updated last year
- ☆1,292Updated 3 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,147Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,272Updated this week
- 这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。☆20Updated last year
- [Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems☆1,408Updated 2 months ago
- ☆440Updated 4 months ago
- 本仓库提供了一个基于PyTorch实现的Transformer模型示例代码,专为初学者设计,用以深入浅出地讲解Transformer架构的工作原理和应用。通过阅读和运行此项目中的代码,学习者可以快速理解自注意力机制、编码器-解码器结构以及如何在实际任务中使用Transfor…☆68Updated last year
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,222Updated 9 months ago
- 历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.☆519Updated 9 months ago
- Survey on LLM Agents (Published on CoLing 2025)☆442Updated 2 months ago
- This is the repository for the Tool Learning survey.☆461Updated 4 months ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆448Updated 11 months ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,011Updated 3 months ago
- Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个☆1,207Updated last year
- My implementation of Stanford CS336 assignments.☆208Updated 5 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆665Updated 3 months ago
- Paper list for Personal LLM Agents☆421Updated last year
- O1 Replication Journey☆2,001Updated 11 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,416Updated 8 months ago