mattian7 / CoT-Papers-NoteLinks
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆41Updated 2 years ago
Alternatives and similar repositories for CoT-Papers-Note
Users that are interested in CoT-Papers-Note are comparing it to the libraries listed below
Sorting:
- A comprehensive collection of process reward models.☆92Updated 2 weeks ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆125Updated 9 months ago
- Paper List for In-context Learning 🌷☆183Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆164Updated last year
- Paper collections of retrieval-based (augmented) language model.☆232Updated last year
- ☆64Updated 3 weeks ago
- ☆222Updated this week
- Official repository for "PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning"☆32Updated this week
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 5 months ago
- ☆62Updated last week
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated 11 months ago
- ☆133Updated 9 months ago
- ☆41Updated 10 months ago
- ☆37Updated this week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆81Updated last year
- A curated list of personalized alignment resources (continually updated).☆22Updated this week
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated this week
- ☆44Updated 4 months ago
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆51Updated 6 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆50Updated 2 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆168Updated 11 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆228Updated 2 weeks ago
- ☆60Updated 5 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆81Updated last year
- ☆82Updated last year
- The related works and background techniques about Openai o1☆222Updated 5 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆125Updated last week
- A research repo for experiments about Reinforcement Finetuning☆48Updated 2 months ago