Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
☆51Jul 10, 2024Updated last year
Alternatives and similar repositories for MR-GSM8K
Users that are interested in MR-GSM8K are comparing it to the libraries listed below
Sorting:
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Mar 15, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆14Feb 8, 2024Updated 2 years ago
- [SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…☆11Oct 1, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- The data for the CRASS-benchmark☆16Oct 24, 2022Updated 3 years ago
- Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration☆17Jan 7, 2021Updated 5 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆51Oct 31, 2024Updated last year
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- ☆35Jan 12, 2026Updated last month
- ☆35Mar 22, 2025Updated 11 months ago
- FuseAI Project☆590Jan 25, 2025Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- [CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields☆126Jul 7, 2023Updated 2 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Experimental method to use reference video to drive motion in generations without training in ComfyUI.☆37Apr 9, 2024Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Jan 18, 2024Updated 2 years ago
- Token Omission Via Attention☆127Oct 13, 2024Updated last year
- Evaluate the Quality of Critique☆36Jun 1, 2024Updated last year
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆37Jan 3, 2024Updated 2 years ago
- Fivetran's Salesforce source dbt package☆13Oct 1, 2025Updated 5 months ago
- Branch Metrics Win32/C++ SDK☆10Jun 10, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"☆29Jan 28, 2026Updated last month
- ☆11Sep 17, 2024Updated last year
- PyTorch implementation for "Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation"☆10Apr 11, 2024Updated last year
- [3D GeoInfo 2025] RoofSense: A Multimodal Semantic Segmentation Dataset for Roofing Material Classification☆10Jul 3, 2025Updated 8 months ago
- Basic operations prototype/syntax for developers☆12Mar 12, 2023Updated 2 years ago
- Official Repo for "Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields"☆36May 27, 2025Updated 9 months ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- Memoir+ a persona memory extension for Text Gen Web UI.☆224Feb 5, 2026Updated last month
- ☆84Nov 10, 2025Updated 3 months ago