meta-math / MetaMathLinks

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

☆447

Alternatives and similar repositories for MetaMath

Users that are interested in MetaMath are comparing it to the libraries listed below

Sorting:

RLHFlow / Online-RLHF
A recipe for online RLHF and online iterative DPO.
☆533Updated 9 months ago
MARIO-Math-Reasoning / Super_MARIO
☆342Updated 4 months ago
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆266Updated last year
TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆377Updated last year
mathllm / MathCoder
[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.
☆326Updated last week
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆352Updated last year
eddycmu / demystify-long-cot
☆323Updated 4 months ago
dvlab-research / Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆384Updated 9 months ago
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆259Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆439Updated last year
Cohere-Labs-Community / parameter-efficient-moe
☆271Updated last year
OpenBMB / OlympiadBench
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆172Updated 4 months ago
QwenLM / AutoIF
☆312Updated last year
HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆440Updated last year
Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆272Updated 7 months ago
YuxiXie / MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆327Updated last year
GAIR-NLP / LIMR
☆211Updated 8 months ago
tongyx361 / Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆143Updated last year
sail-sg / oat-zero
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
☆247Updated 6 months ago
OpenBMB / Eurus
☆319Updated last year
lupantech / MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
☆342Updated 3 weeks ago
kanishkg / cognitive-behaviors
☆210Updated 7 months ago
RLHFlow / Minimal-RL
☆243Updated 5 months ago
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆643Updated 4 months ago
hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆572Updated 10 months ago
iiis-ai / cumulative-reasoning
[TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)
☆302Updated 2 months ago
ezelikman / STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆214Updated 2 years ago
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆522Updated 9 months ago
princeton-nlp / LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
☆496Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆477Updated last year