Testing baseline LLMs performance across various models
☆340Mar 20, 2026Updated last week
Alternatives and similar repositories for arc-agi-benchmarking
Users that are interested in arc-agi-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆676May 22, 2025Updated 10 months ago
- ☆15Jun 19, 2025Updated 9 months ago
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆47Jan 27, 2026Updated 2 months ago
- ☆162Mar 19, 2026Updated last week
- Bootstrapping ARC☆156Nov 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Abstraction and Reasoning Corpus☆4,737Apr 4, 2025Updated 11 months ago
- ☆30Aug 7, 2025Updated 7 months ago
- Reverse Engineering the Abstraction and Reasoning Corpus☆336Feb 24, 2025Updated last year
- ☆39Feb 25, 2024Updated 2 years ago
- ☆27Aug 16, 2025Updated 7 months ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆324Oct 11, 2024Updated last year
- Draw more samples☆198Jun 23, 2024Updated last year
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆22Aug 17, 2024Updated last year
- ☆19Jul 31, 2025Updated 7 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Run GEPA on your favorite non-python libraries.☆34Jan 22, 2026Updated 2 months ago
- Materials for ConceptARC paper☆117Feb 10, 2026Updated last month
- ☆485Jul 18, 2025Updated 8 months ago
- ☆51Mar 13, 2026Updated 2 weeks ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆345Nov 10, 2025Updated 4 months ago
- The history files when recording human interaction while solving ARC tasks☆117Mar 2, 2026Updated 3 weeks ago
- Implementation of SOAR☆51Sep 17, 2025Updated 6 months ago
- my solution for Abstaction and reasoning challenge on kaggle☆10Jun 23, 2024Updated last year
- An Open Source SLM Trained for MCP☆23May 18, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆86Mar 28, 2025Updated last year
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆36Feb 17, 2026Updated last month
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆20Sep 27, 2024Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆318Jun 26, 2025Updated 9 months ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- ☆108Jun 30, 2025Updated 9 months ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆70Aug 30, 2024Updated last year
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- A selective knowledge distillation algorithm for efficient speculative decoders☆36Nov 27, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆651Jul 29, 2025Updated 8 months ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Universal MCP IdP (Identity Provider) - Support Thousands of Integrations, Zero Maintenance☆30Dec 25, 2025Updated 3 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,237Aug 27, 2025Updated 7 months ago
- ☆100Mar 8, 2026Updated 3 weeks ago