reasoning-survey / Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
☆496Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Reasoning-Foundation-Models:
Users that are interested in Awesome-Reasoning-Foundation-Models are comparing it to the libraries listed below
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆521Updated 2 weeks ago
- Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.☆578Updated this week
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆329Updated 6 months ago
- ☆384Updated 3 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆485Updated 2 months ago
- A series of technical report on Slow Thinking with LLM☆297Updated last week
- An Awesome Collection for LLM Survey☆321Updated 4 months ago
- ☆432Updated 2 weeks ago
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆919Updated 3 weeks ago
- Recipes to train reward model for RLHF.☆1,084Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,448Updated 3 weeks ago
- papers related to LLM-agent that published on top conferences☆309Updated 11 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆400Updated 2 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆388Updated this week
- O1 Replication Journey: A Strategic Progress Report – Part I☆1,861Updated this week
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆800Updated 2 months ago
- Paper List for In-context Learning 🌷☆827Updated 3 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆447Updated 9 months ago
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆474Updated 9 months ago
- The related works and background techniques about Openai o1☆192Updated last week
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆908Updated last month
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆382Updated 9 months ago
- Aligning Large Language Models with Human: A Survey☆709Updated last year
- ☆295Updated last month
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,166Updated this week
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆966Updated last month
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆178Updated this week
- Paper collection on building and evaluating language model agents via executable language grounding☆343Updated 8 months ago
- Large Reasoning Models☆787Updated last month
- RewardBench: the first evaluation tool for reward models.☆491Updated last week