mattian7 / CoT-Papers-Note
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
β41Updated last year
Alternatives and similar repositories for CoT-Papers-Note
Users that are interested in CoT-Papers-Note are comparing it to the libraries listed below
Sorting:
- Paper List for In-context Learning π·β183Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β116Updated 7 months ago
- A comprehensive collection of process reward models.β76Updated this week
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ162Updated last year
- β81Updated last year
- β117Updated 8 months ago
- β55Updated 7 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"β113Updated last week
- [SIGIR'24] The official implementation code of MOELoRA.β162Updated 9 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise deβ¦β53Updated 10 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β80Updated last year
- An Arena-style Automated Evaluation Benchmark for Detailed Captioningβ31Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β208Updated 2 weeks ago
- [Preprint] A Neural-Symbolic Self-Training Frameworkβ107Updated last month
- [ICML'2024] Can AI Assistants Know What They Don't Know?β81Updated last year
- A Survey on the Honesty of Large Language Modelsβ57Updated 5 months ago
- β74Updated 11 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.β61Updated 3 months ago
- β41Updated 3 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β52Updated 5 months ago
- A paper list about diffusion models for natural language processing.β182Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)β138Updated 3 months ago
- A research repo for experiments about Reinforcement Finetuningβ46Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β119Updated last month
- β45Updated 6 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generβ¦β60Updated 10 months ago
- β153Updated last month
- A curated list of personalized alignment resources (continually updated).β16Updated 3 weeks ago
- β140Updated last year
- Paper collections of retrieval-based (augmented) language model.β232Updated 11 months ago