reasoning-survey / Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
☆513Updated last month
Alternatives and similar repositories for Awesome-Reasoning-Foundation-Models:
Users that are interested in Awesome-Reasoning-Foundation-Models are comparing it to the libraries listed below
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆565Updated 3 weeks ago
- A series of technical report on Slow Thinking with LLM☆393Updated this week
- The related works and background techniques about Openai o1☆208Updated last month
- An Awesome Collection for LLM Survey☆326Updated 5 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆407Updated 3 weeks ago
- ☆398Updated 4 months ago
- papers related to LLM-agent that published on top conferences☆311Updated last year
- Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.☆585Updated this week
- Large Reasoning Models☆802Updated 2 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆347Updated 3 weeks ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆495Updated 3 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆411Updated 3 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆255Updated 6 months ago
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆977Updated 2 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆451Updated 10 months ago
- LLM hallucination paper list☆302Updated 11 months ago
- ☆468Updated last month
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆533Updated 2 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆798Updated this week
- Collection of training data management explorations for large language models☆307Updated 6 months ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆185Updated this week
- ☆889Updated 6 months ago
- This is the repository for the Tool Learning survey.☆301Updated this week
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆814Updated 3 months ago
- Efficient Multimodal Large Language Models: A Survey☆312Updated 6 months ago
- O1 Replication Journey☆1,945Updated last month
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,005Updated last month
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,217Updated last week
- Official repository for ICLR 2025 paper "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient an…☆619Updated this week