microsoft/LEMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/LEMA)

microsoft / LEMA

official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"

☆60

Alternatives and similar repositories for LEMA

Users that are interested in LEMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
microsoft / Do-You-See-Me
View on GitHub
☆13Jun 21, 2025Updated last year
open-compass / CIBench
View on GitHub
Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "
☆15Jul 19, 2024Updated 2 years ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Sphere-AI-Lab / FormalMATH-Bench
View on GitHub
Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>
☆75Jan 8, 2026Updated 6 months ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆106Mar 6, 2025Updated last year
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
LeiLiLab / HardTestGen
View on GitHub
☆17Jan 27, 2026Updated 5 months ago
HillZhang1999 / RobustGEC
View on GitHub
Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)
☆17Jan 23, 2024Updated 2 years ago
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year
THUKElab / Visual-C3
View on GitHub
Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters
☆17May 30, 2024Updated 2 years ago
wiio12 / POETRY
View on GitHub
Code for the paper: Proving Theorems Recursively
☆12May 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
He-Ren / OJBench
View on GitHub
☆32Feb 28, 2026Updated 4 months ago
cicero225 / llm_pokemon_scaffold
View on GitHub
☆34May 31, 2025Updated last year
bin123apple / MACM
View on GitHub
[NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
☆94Jul 24, 2024Updated last year
DAMO-NLP-SG / RemeMo
View on GitHub
[EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning
☆17Oct 31, 2023Updated 2 years ago
yzhangcs / ctc-copy
View on GitHub
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
☆20Oct 17, 2023Updated 2 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
ZuyiZhou / Awesome-Interpretable-Cross-modal-Reasoning
View on GitHub
A Survey on Interpretable Cross-modal Reasoning
☆15Oct 12, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mukhal / GRACE
View on GitHub
[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning
☆50Oct 11, 2024Updated last year
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆470Apr 18, 2024Updated 2 years ago
Dahoas / QDSyntheticData
View on GitHub
☆14Aug 15, 2024Updated last year
Essential-AI / reflection
View on GitHub
☆50Apr 11, 2025Updated last year
zitian-gao / SC-MCTS
View on GitHub
Interpretable Contrastive Monte Carlo Tree Search Reasoning
☆52Nov 9, 2024Updated last year
bammt / Learn-to-check
View on GitHub
the datasets of our paper
☆11Feb 26, 2024Updated 2 years ago
AlexeySorokin / EditScorer
View on GitHub
The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"
☆21Dec 14, 2022Updated 3 years ago
ying-hui-he / Hi-ToM_dataset
View on GitHub
☆21Oct 11, 2025Updated 9 months ago
microsoft / text-to-sql-schema-expansion-generalization
View on GitHub
Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion
☆13Jul 26, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
StigLidu / CodeGym
View on GitHub
[ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"
☆32Oct 14, 2025Updated 9 months ago
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 5 months ago
zhliu0106 / probing-lm-data
View on GitHub
Official Implementation of "Probing Language Models for Pre-training Data Detection"
☆20Dec 4, 2024Updated last year
li-aolong / TemplateGEC
View on GitHub
ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template
☆23Jul 10, 2023Updated 3 years ago
OpenBMB / Eurus
View on GitHub
☆322Sep 18, 2024Updated last year
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
JIA-Lab-research / Step-DPO
View on GitHub
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆398Jan 19, 2025Updated last year