ytyz1307zzh / RefAugLinks
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆55Updated last year
Alternatives and similar repositories for RefAug
Users that are interested in RefAug are comparing it to the libraries listed below
Sorting:
- ☆127Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆148Updated 11 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆98Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 10 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- ☆136Updated 10 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆80Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆155Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆202Updated 3 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆116Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆82Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆59Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆59Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆123Updated 10 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆106Updated 2 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 7 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆172Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F …☆68Updated last year
- ☆74Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆121Updated last year
- Replicating O1 inference-time scaling laws☆90Updated 10 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆116Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment