TianshuoY / HKU-DASC7606-A1Links
☆25Updated 9 months ago
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆15Updated 8 months ago
- ☆13Updated 7 months ago
- ICLR 2025 Agent-Related Papers☆70Updated 7 months ago
- Awesome RL-based LLM Reasoning☆526Updated last month
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆249Updated 3 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆390Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆573Updated last month
- MMGraphRAG is a multi-modal knowledge graph-based framework designed to enhance complex reasoning tasks, such as multi-modal document que…☆13Updated 3 weeks ago
- ☆242Updated last month
- ☆70Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆264Updated last week
- ☆77Updated 10 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆32Updated 2 months ago
- Awesome RL Reasoning Recipes ("Triple R")☆706Updated last week
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆48Updated last month
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆922Updated last week
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆125Updated last week
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆399Updated 5 months ago
- A Survey on Multimodal Retrieval-Augmented Generation☆231Updated 3 weeks ago
- ☆541Updated 5 months ago
- Demo code for the paper "Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up."☆4Updated 3 weeks ago
- Awesome Agent Training☆164Updated this week
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆49Updated last month
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆295Updated 6 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆188Updated 2 months ago
- ☆337Updated 4 months ago
- Survey on LLM Agents (Published on CoLing 2025)☆319Updated last month
- ☆242Updated 3 weeks ago
- This is the repository for the Tool Learning survey.☆395Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆271Updated last week