ytyz1307zzh / RefAugView external linksLinks
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆54Oct 1, 2024Updated last year
Alternatives and similar repositories for RefAug
Users that are interested in RefAug are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Mar 23, 2025Updated 10 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆24Jul 22, 2024Updated last year
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated 11 months ago
- ☆16Sep 4, 2025Updated 5 months ago
- Joint Multi-label Attention Network (JMAN)☆12Sep 17, 2020Updated 5 years ago
- ☆16Mar 1, 2025Updated 11 months ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆26Jul 23, 2025Updated 6 months ago
- ☆30Dec 27, 2024Updated last year
- Code for Quiet-STaR☆740Aug 21, 2024Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits☆13Sep 11, 2024Updated last year
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago
- Official code for infimm-hd☆16Sep 4, 2024Updated last year
- Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"☆15Apr 24, 2023Updated 2 years ago
- The code implementation of Symbolic-MoE☆46Sep 2, 2025Updated 5 months ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- Official implementation of TBA for async LLM post-training.☆28Nov 5, 2025Updated 3 months ago
- ☆52Jul 18, 2024Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- Exploring Model Kinship for Merging Large Language Models☆27Apr 16, 2025Updated 9 months ago
- ☆22Sep 2, 2025Updated 5 months ago
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Dec 8, 2025Updated 2 months ago
- ☆25Jun 28, 2024Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆96May 16, 2025Updated 8 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Nov 23, 2022Updated 3 years ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Jan 21, 2025Updated last year
- ☆56Nov 6, 2024Updated last year
- ☆19Jul 15, 2022Updated 3 years ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- Official repository for EMNLP'22 paper: Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering☆24Oct 20, 2022Updated 3 years ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69May 13, 2025Updated 9 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 6 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆112Oct 15, 2024Updated last year
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year