mattian7 / CoT-Papers-NoteLinks
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
β41Updated last year
Alternatives and similar repositories for CoT-Papers-Note
Users that are interested in CoT-Papers-Note are comparing it to the libraries listed below
Sorting:
- Paper List for In-context Learning π·β183Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β123Updated 8 months ago
- A comprehensive collection of process reward models.β85Updated last week
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ164Updated last year
- β81Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?β80Updated last year
- Paper collections of retrieval-based (augmented) language model.β232Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β80Updated last year
- Feeling confused about super alignment? Here is a reading listβ42Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inferenceβ61Updated this week
- β126Updated 8 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β218Updated last week
- β57Updated this week
- The related works and background techniques about Openai o1β221Updated 4 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).β40Updated last year
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generβ¦β60Updated 10 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correctβ176Updated 4 months ago
- β141Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β121Updated 2 months ago
- β60Updated 2 weeks ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"β59Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questionsβ111Updated 8 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWWβ¦β127Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β54Updated 6 months ago
- β39Updated 9 months ago
- A curated list of personalized alignment resources (continually updated).β22Updated last week
- The official code repository for PRMBench.β73Updated 3 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β120Updated 7 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)β139Updated 3 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluationβ50Updated 2 months ago