arcprize/arc-agi-benchmarking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arcprize/arc-agi-benchmarking)

arcprize / arc-agi-benchmarking

Testing baseline LLMs performance across various models

☆351

Alternatives and similar repositories for arc-agi-benchmarking

Users that are interested in arc-agi-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arcprize / ARC-AGI-2
View on GitHub
☆726May 22, 2025Updated last year
MohamedOsman1998 / deep-learning-for-arc
View on GitHub
☆15Jun 19, 2025Updated last year
xu3kev / BARC
View on GitHub
Bootstrapping ARC
☆162Nov 20, 2024Updated last year
michaelhodel / re-arc
View on GitHub
Reverse Engineering the Abstraction and Reasoning Corpus
☆355Feb 24, 2025Updated last year
fchollet / ARC-AGI
View on GitHub
The Abstraction and Reasoning Corpus
☆4,804Apr 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KSB21ST / MINI-ARC
View on GitHub
☆40Feb 25, 2024Updated 2 years ago
alxndrTL / ARC_LLMs
View on GitHub
Evaluating majors LLMs on the Abstraction and Reasoning Corpus
☆17Nov 9, 2023Updated 2 years ago
michaelhodel / arc-dsl
View on GitHub
Domain Specific Language for the Abstraction and Reasoning Corpus
☆342Oct 11, 2024Updated last year
flowersteam / SOAR
View on GitHub
Implementation of SOAR
☆55Sep 17, 2025Updated 10 months ago
rgreenblatt / arc_draw_more_samples_pub
View on GitHub
Draw more samples
☆198Jun 23, 2024Updated 2 years ago
Le-Gris / h-arc
View on GitHub
☆45Mar 30, 2026Updated 3 months ago
arcprize / ARC-AGI-3-Agents
View on GitHub
☆287May 28, 2026Updated last month
apple / ml-reversal-blessing
View on GitHub
☆17Jul 31, 2025Updated 11 months ago
neurallambda / arc-like
View on GitHub
Like ARC, but code to generate visual puzzles. 1D puzzles first.
☆23Aug 17, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
khalil-research / ARGA-AAAI23
View on GitHub
Abstract Reasoning with Graph Abstractions (ARGA) implementation
☆61Jul 5, 2024Updated 2 years ago
victorvikram / ConceptARC
View on GitHub
Materials for ConceptARC paper
☆119Feb 10, 2026Updated 5 months ago
DriesSmit / ARC3-solution
View on GitHub
My submission to the ARC-AGI-3 Developer Preview Agent Compitition.
☆92Jan 27, 2026Updated 5 months ago
latticetower / kaggle-arc
View on GitHub
my solution for Abstaction and reasoning challenge on kaggle
☆10Jun 23, 2024Updated 2 years ago
microsoft / chemistry-qa
View on GitHub
☆15Nov 6, 2020Updated 5 years ago
neoneye / ARC-Interactive-History-Dataset
View on GitHub
The history files when recording human interaction while solving ARC tasks
☆118Jun 28, 2026Updated 3 weeks ago
jerber / arc-lang-public
View on GitHub
☆314Dec 12, 2025Updated 7 months ago
ekinakyurek / marc
View on GitHub
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆353Nov 10, 2025Updated 8 months ago
neoneye / arc-dataset-collection
View on GitHub
Multiple datasets for ARC (Abstraction and Reasoning Corpus)
☆91Mar 28, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
epang080516 / arc_agi
View on GitHub
SoTA Approach for ARC-AGI 2
☆159Sep 16, 2025Updated 10 months ago
aw31 / openai-imo-2025-proofs
View on GitHub
☆485Jul 18, 2025Updated last year
ConfeitoHS / arcle
View on GitHub
A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)
☆73Aug 30, 2024Updated last year
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
lalalune / gptcoder
View on GitHub
RAG Agent for the ARC AGI Challenge
☆20Jul 1, 2024Updated 2 years ago
zoecarver / saturn-arc
View on GitHub
☆27Aug 16, 2025Updated 11 months ago
aidanmclaughlin / AidanBench
View on GitHub
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆319Jun 26, 2025Updated last year
beetree / ARC-AGI
View on GitHub
☆78May 31, 2026Updated last month
samacqua / LARC
View on GitHub
Language-annotated Abstraction and Reasoning Corpus
☆99Mar 24, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eric-mitchell / macaw-min
View on GitHub
Clean, extensible implementation of MACAW [ICML 2021]
☆12Dec 7, 2021Updated 4 years ago
arcprize / ARCEngine
View on GitHub
Simple Python Game Engine
☆27Jan 29, 2026Updated 5 months ago
GarrettG-AI / deep-writer-outputs
View on GitHub
Outputs from the Deep Writer
☆16Sep 11, 2024Updated last year
evintunador / FractalFormer
View on GitHub
A GPT with self-similar nested properties
☆20Mar 19, 2024Updated 2 years ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
murxla / murxla
View on GitHub
A model-based API Fuzzer for SMT Solvers.
☆16May 20, 2026Updated 2 months ago
arcprize / ARC-AGI
View on GitHub
ARC-AGI Toolkit
☆64Jun 10, 2026Updated last month