mattian7 / CoT-Papers-NoteLinks
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
β41Updated 2 years ago
Alternatives and similar repositories for CoT-Papers-Note
Users that are interested in CoT-Papers-Note are comparing it to the libraries listed below
Sorting:
- Paper List for In-context Learning π·β183Updated last year
- A comprehensive collection of process reward models.β96Updated 2 weeks ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ165Updated last year
- β252Updated last month
- The related works and background techniques about Openai o1β224Updated 6 months ago
- [SIGIR'24] The official implementation code of MOELoRA.β174Updated last year
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Futureβ457Updated 6 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β129Updated 10 months ago
- A method of ensemble learning for heterogeneous large language models.β58Updated last year
- β52Updated last month
- [ACL 2025] A Neural-Symbolic Self-Training Frameworkβ109Updated 2 months ago
- A curated list of personalized alignment resources (continually updated).β34Updated 2 weeks ago
- β151Updated 10 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).β40Updated last year
- A research repo for experiments about Reinforcement Finetuningβ49Updated 4 months ago
- Paper collections of retrieval-based (augmented) language model.β232Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β56Updated 8 months ago
- The awesome agents in the era of large language modelsβ67Updated last year
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by gβ¦β35Updated 3 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β125Updated 4 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β79Updated last year
- β102Updated 2 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β319Updated last year
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Modelsβ37Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β241Updated 2 months ago
- β47Updated 5 months ago
- β67Updated last month
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β125Updated 9 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondβ277Updated last month
- β48Updated 9 months ago