Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
☆52Jul 10, 2024Updated last year
Alternatives and similar repositories for MR-GSM8K
Users that are interested in MR-GSM8K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal, educational HEVC (H.265) encoder written in Python.☆53Feb 23, 2026Updated 4 months ago
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Mar 15, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration☆17Jan 7, 2021Updated 5 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆41May 31, 2024Updated 2 years ago
- CURRENNT -- CUDA-enabled machine learning library for recurrent neural network☆16Feb 20, 2020Updated 6 years ago
- ☆16Mar 22, 2025Updated last year
- QAQ: Quality Adaptive Quantization for LLM KV Cache☆55Mar 27, 2024Updated 2 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆285Aug 4, 2025Updated 11 months ago
- ☆12May 6, 2024Updated 2 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆46Jan 11, 2024Updated 2 years ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆18Mar 11, 2025Updated last year
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- ☆34Sep 19, 2025Updated 9 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆54Jun 24, 2024Updated 2 years ago
- Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Ne…☆137Jul 2, 2024Updated 2 years ago
- ☆13Jul 14, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆82Jan 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆178Jun 22, 2025Updated last year
- ☆11Apr 4, 2023Updated 3 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- ☆20Jan 26, 2026Updated 5 months ago
- [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs☆158Jul 23, 2024Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Apr 2, 2024Updated 2 years ago
- ☆84Jun 2, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Collection of awesome Continual Test-Time Adaptation methods☆24Jun 4, 2024Updated 2 years ago
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 3 years ago
- A macOS application for accessing the output of the SimpleAnalytics package on the desktop.☆11Oct 8, 2023Updated 2 years ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆13Aug 11, 2025Updated 10 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆30Dec 10, 2024Updated last year
- ☆15Oct 24, 2022Updated 3 years ago