atfortes / Awesome-LLM-Reasoning
Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 π
β3,031Updated this week
Alternatives and similar repositories for Awesome-LLM-Reasoning:
Users that are interested in Awesome-LLM-Reasoning are comparing it to the libraries listed below
- π° Must-read papers and blogs on LLM based Long Context Modeling π₯β1,455Updated 3 weeks ago
- Must-read Papers on LLM Agents.β2,345Updated 2 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought promptingβ2,723Updated 9 months ago
- A library for advanced large language model reasoningβ2,116Updated 3 weeks ago
- O1 Replication Journeyβ1,986Updated 3 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)β6,595Updated this week
- Reference implementation for DPO (Direct Preference Optimization)β2,560Updated 8 months ago
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...β1,988Updated last week
- A repo lists papers related to LLM based agentβ1,607Updated 3 weeks ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.β2,265Updated this week
- [ACL 2023] Reasoning with Language Model Prompting: A Surveyβ952Updated last month
- Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Modelsβ1,193Updated 2 months ago
- A quick guide (especially) for trending instruction finetuning datasetsβ3,045Updated last year
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsβ1,762Updated 3 months ago
- Must-read Papers on Knowledge Editing for Large Language Models.β1,076Updated 2 months ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".β2,035Updated last year
- AllenAI's post-training codebaseβ2,942Updated this week
- A curated list for Efficient Large Language Modelsβ1,644Updated 2 weeks ago
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitβ¦β1,014Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,671Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,259Updated 2 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.β1,736Updated 4 months ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 π and reasoning techniques.β6,715Updated this week
- β2,783Updated 2 months ago
- β¨β¨Latest Papers and Benchmarks in Reasoning with Foundation Modelsβ571Updated 2 weeks ago
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,145Updated last month
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.β758Updated last year
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".β1,519Updated last month
- Paper List for In-context Learning π·β854Updated 7 months ago
- Official Repo for Open-Reasoner-Zeroβ1,904Updated last month