JIA-Lab-research/MR-GSM8K

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JIA-Lab-research/MR-GSM8K)

JIA-Lab-research / MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

☆52

Alternatives and similar repositories for MR-GSM8K

Users that are interested in MR-GSM8K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JIA-Lab-research / Mr-Ben
View on GitHub
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"
☆51Oct 31, 2024Updated last year
JIA-Lab-research / GroupContrast
View on GitHub
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
☆45Mar 15, 2024Updated 2 years ago
camenduru / LGM-replicate
View on GitHub
☆14Feb 8, 2024Updated 2 years ago
AlignInc / aligner-replication
View on GitHub
The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
☆21May 29, 2024Updated 2 years ago
apergo-ai / CRASS-data-set
View on GitHub
The data for the CRASS-benchmark
☆17Oct 24, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ekinakyurek / gpt3-arithmetic
View on GitHub
Scratchpad/Chain-of-Thought Prompts
☆12Jun 6, 2022Updated 4 years ago
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
apple / ml-entity-deduction-arena
View on GitHub
☆39May 31, 2024Updated 2 years ago
sabithsn / APPDIA-Discourse-Style-Transfer
View on GitHub
Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…
☆13Sep 8, 2022Updated 3 years ago
Ethan-TZ / EulerFormer
View on GitHub
[SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…
☆11Oct 1, 2024Updated last year
ash-neupane / multi-token-pred
View on GitHub
Train toy models using multi-token prediction objective
☆14Apr 18, 2026Updated 3 months ago
EnVision-Research / DDSM
View on GitHub
Denoising Diffusion Step-aware Models (ICLR2024)
☆62Feb 6, 2024Updated 2 years ago
nlp-waseda / mtl-eadrg
View on GitHub
Emotion-Aware Dialogue Response Generation by Multi-Task Learning
☆13Jan 22, 2022Updated 4 years ago
bjj / exllamav2-openai-server
View on GitHub
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆24Feb 9, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cultural-csk / candle
View on GitHub
Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)
☆11Feb 15, 2024Updated 2 years ago
salavi / Clever_Hans_or_N-ToM
View on GitHub
☆12May 6, 2024Updated 2 years ago
dice-group / KG-NMT
View on GitHub
Knowledge Graph-augmented NMT
☆11Sep 20, 2021Updated 4 years ago
aflah02 / Easy-Data-Augmentation-Implementation
View on GitHub
My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…
☆12Mar 18, 2022Updated 4 years ago
TristanThrush / i-am-a-strange-dataset
View on GitHub
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆46Jan 11, 2024Updated 2 years ago
THUDM / ChatGLM-Math
View on GitHub
☆82Apr 18, 2024Updated 2 years ago
JIA-Lab-research / DecoupleNet
View on GitHub
Official implementation for our ECCV 2022 paper "DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation"
☆36Jan 3, 2023Updated 3 years ago
JIA-Lab-research / DiffComplete
View on GitHub
Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"
☆130Aug 14, 2024Updated last year
TianheL / LM-Implicit-Reasoning
View on GitHub
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆18Mar 11, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JiazhengZhang / AgentV-RL
View on GitHub
☆15Apr 17, 2026Updated 3 months ago
YuxiangChai / A3
View on GitHub
☆35Jan 12, 2026Updated 6 months ago
GAIR-NLP / Entropy-ABF
View on GitHub
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆83Jan 18, 2024Updated 2 years ago
serp-ai / unsloth
View on GitHub
5X faster 60% less memory QLoRA finetuning
☆21May 28, 2024Updated 2 years ago
Edward-Sun / easy-to-hard
View on GitHub
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆124Sep 9, 2024Updated last year
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
Cornell-RelaxML / qtip
View on GitHub
☆181Jun 22, 2025Updated last year
JIA-Lab-research / Prompt-Highlighter
View on GitHub
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
☆159Jul 23, 2024Updated 2 years ago
Edward-Sun / gpt-accelera
View on GitHub
Simple and efficient pytorch-native transformer training and inference (batched)
☆78Apr 2, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
SalesforceAIResearch / GemFilter
View on GitHub
☆84Jun 2, 2026Updated last month
yrf1 / LLM-MassiveMulticultureNormsKnowledge-NCLB
View on GitHub
☆20Mar 12, 2025Updated last year
KaiNylund / lm-weights-encode-time
View on GitHub
☆68Aug 16, 2024Updated last year
WadeYin9712 / GeoMLAMA
View on GitHub
☆15Oct 24, 2022Updated 3 years ago
princeton-nlp / Cognac
View on GitHub
Repo for paper: Controllable Text Generation with Language Constraints
☆20Jun 20, 2023Updated 3 years ago
WooooDyy / LLM-Reverse-Curriculum-RL
View on GitHub
Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…
☆116Feb 9, 2024Updated 2 years ago
jcottaar / seismic
View on GitHub
Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)
☆13Aug 11, 2025Updated 11 months ago