ars22/scaling-LLM-math-synthetic-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ars22/scaling-LLM-math-synthetic-data)

ars22 / scaling-LLM-math-synthetic-data

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

☆32

Alternatives and similar repositories for scaling-LLM-math-synthetic-data

Users that are interested in scaling-LLM-math-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
LouChao98 / nner_as_parsing
View on GitHub
☆16Mar 22, 2023Updated 3 years ago
rycolab / parsing-as-tagging
View on GitHub
☆21Nov 19, 2023Updated 2 years ago
zsLin177 / SRL-as-GP
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
nuochenpku / LLaMA_Analysis
View on GitHub
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆31Jan 13, 2024Updated 2 years ago
qtli / GSM-Plus
View on GitHub
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆66Jul 8, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
THU-KEG / KoLA
View on GitHub
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆57Sep 28, 2023Updated 2 years ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year
JamyDon / PLM-based-CGEC-Model-Ensemble
View on GitHub
[ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?
☆10Dec 15, 2025Updated 7 months ago
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
VPeterV / RankSpace-Models
View on GitHub
source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"
☆10Sep 26, 2022Updated 3 years ago
databricks / officeqa
View on GitHub
Repository for getting started with the OfficeQA Benchmark.
☆161Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Yinghao-Li / CHMM-ALT
View on GitHub
Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"
☆32Jun 20, 2023Updated 3 years ago
ictnlp / FA-DAT
View on GitHub
Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"
☆14Mar 1, 2023Updated 3 years ago
FadedCosine / Dependency-Guided-Neural-Text-Generation
View on GitHub
Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…
☆26May 27, 2022Updated 4 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago
RakitinDen / pytorch-recursive-gumbel-max-trick
View on GitHub
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021
☆14Dec 11, 2021Updated 4 years ago
ictnlp / HMT
View on GitHub
Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"
☆24Dec 11, 2023Updated 2 years ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UCSB-NLP-Chang / ThinkPrune
View on GitHub
☆46Sep 27, 2025Updated 9 months ago
ShannonAI / mrc-for-dependency-parsing
View on GitHub
☆18May 28, 2021Updated 5 years ago
hyp1231 / ICLR2023-OpenReviewData
View on GitHub
Crawl & visualize ICLR papers and reviews.
☆18Nov 5, 2022Updated 3 years ago
chaojiang06 / neural-Jacana
View on GitHub
This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.
☆20Feb 12, 2023Updated 3 years ago
acosharma / elita-transformer
View on GitHub
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Jun 2, 2024Updated 2 years ago
sjelassi / transformers_ssm_copy
View on GitHub
☆40Feb 26, 2024Updated 2 years ago
danieldritter / OAPL
View on GitHub
☆30Feb 24, 2026Updated 4 months ago
abacusai / smaug
View on GitHub
☆77Feb 22, 2024Updated 2 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhaolinGao / REBEL
View on GitHub
Reinforcement Learning via Regressing Relative Rewards
☆40Dec 12, 2024Updated last year
liziniu / GEM
View on GitHub
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆58May 12, 2025Updated last year
Spico197 / MoE-SFT
View on GitHub
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
☆41Sep 29, 2024Updated last year
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
lyutyuh / structured-span-selector
View on GitHub
A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…
☆21Jul 11, 2022Updated 4 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
wwxu21 / CUT
View on GitHub
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Feb 29, 2024Updated 2 years ago