iiis-ai / IterativeQuestionComposing
Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09003)
☆19Updated 3 months ago
Alternatives and similar repositories for IterativeQuestionComposing:
Users that are interested in IterativeQuestionComposing are comparing it to the libraries listed below
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆54Updated 8 months ago
- ☆29Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated 9 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆53Updated 3 months ago
- Trending projects & awesome papers about data-centric llm studies.☆33Updated 2 months ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆26Updated 5 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated 11 months ago
- ☆24Updated 6 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆25Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 4 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆29Updated 9 months ago
- Towards Systematic Measurement for Long Text Quality☆33Updated 6 months ago
- Analyzing LLM Alignment via Token distribution shift☆15Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 2 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆51Updated 9 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆28Updated 10 months ago
- ☆14Updated 8 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆35Updated 8 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated last year
- ☆34Updated 11 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- ☆59Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆74Updated 2 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆24Updated last year
- ☆20Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆72Updated 9 months ago