LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
☆117Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Long-Chain-of-Thought-Reasoning:
Users that are interested in Awesome-Long-Chain-of-Thought-Reasoning are comparing it to the libraries listed below
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆153Updated this week
- A Survey on Efficient Reasoning for LLMs☆116Updated this week
- ☆54Updated 5 months ago
- A Comprehensive Survey on Long Context Language Modeling☆86Updated last week
- The demo, code and data of FollowRAG☆70Updated 3 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 3 months ago
- ☆105Updated 6 months ago
- ☆186Updated this week
- The code and data of DPA-RAG☆58Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆104Updated last week
- ☆41Updated last week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆148Updated last week
- SOTA RL fine-tuning solution for advanced math reasoning of LLM☆91Updated this week
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆124Updated 3 months ago
- ☆34Updated 3 weeks ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆73Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆110Updated 6 months ago
- The official code repository for PRMBench.☆68Updated last month
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆116Updated 4 months ago
- ☆166Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆65Updated this week
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆23Updated 2 months ago
- ☆64Updated 9 months ago
- A research repo for experiments about Reinforcement Finetuning☆36Updated last week
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆112Updated last month
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆64Updated last month
- ☆83Updated 2 weeks ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆37Updated 3 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆107Updated 6 months ago
- ☆113Updated 2 months ago