Testing baseline LLMs performance across various models
☆350Apr 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for arc-agi-benchmarking
Users that are interested in arc-agi-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jun 19, 2025Updated 11 months ago
- Bootstrapping ARC☆158Nov 20, 2024Updated last year
- ☆236May 14, 2026Updated last week
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆77Jan 27, 2026Updated 3 months ago
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Nov 9, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The Abstraction and Reasoning Corpus☆4,768Apr 4, 2025Updated last year
- Reverse Engineering the Abstraction and Reasoning Corpus☆348Feb 24, 2025Updated last year
- ☆39Mar 30, 2026Updated last month
- ☆27Aug 16, 2025Updated 9 months ago
- ☆40Feb 25, 2024Updated 2 years ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆334Oct 11, 2024Updated last year
- Draw more samples☆198Jun 23, 2024Updated last year
- ☆18Jul 31, 2025Updated 9 months ago
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆23Aug 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆486Jul 18, 2025Updated 10 months ago
- Materials for ConceptARC paper☆118Feb 10, 2026Updated 3 months ago
- Abstract Reasoning with Graph Abstractions (ARGA) implementation☆61Jul 5, 2024Updated last year
- The history files when recording human interaction while solving ARC tasks☆117May 7, 2026Updated 2 weeks ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆349Nov 10, 2025Updated 6 months ago
- Implementation of SOAR☆52Sep 17, 2025Updated 8 months ago
- my solution for Abstaction and reasoning challenge on kaggle☆10Jun 23, 2024Updated last year
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆36Feb 17, 2026Updated 3 months ago
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆90Mar 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆21Sep 27, 2024Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- ☆109Jun 30, 2025Updated 10 months ago
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆319Jun 26, 2025Updated 10 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆679Jul 29, 2025Updated 9 months ago
- Language-annotated Abstraction and Reasoning Corpus☆99Mar 24, 2026Updated last month
- ☆36Jul 13, 2023Updated 2 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simplified class for Zoltraak, a digital content production framework like program codes, images, speeches, presentations, books and vide…☆15Sep 25, 2024Updated last year
- Outputs from the Deep Writer☆16Sep 11, 2024Updated last year
- Universal MCP IdP (Identity Provider) - Support Thousands of Integrations, Zero Maintenance☆30Dec 25, 2025Updated 4 months ago
- withMcp: Turn your API Server into an MCP with 1 line of code☆34Oct 26, 2025Updated 6 months ago
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,259Aug 27, 2025Updated 8 months ago
- ☆102Mar 8, 2026Updated 2 months ago