google / ARC-GENLinks
A Mimetic Procedural Benchmark Generator for the Abstraction and Reasoning Corpus
☆39Updated 3 weeks ago
Alternatives and similar repositories for ARC-GEN
Users that are interested in ARC-GEN are comparing it to the libraries listed below
Sorting:
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆30Updated 4 months ago
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Updated 2 years ago
- Implementation of SOAR☆46Updated 3 months ago
- ☆15Updated 6 months ago
- ☆600Updated 7 months ago
- Curated collection of community environments☆200Updated this week
- Reverse Engineering the Abstraction and Reasoning Corpus☆329Updated 10 months ago
- Stupid test to check whether MDL principles improve ARC performance☆75Updated this week
- Testing baseline LLMs performance across various models☆332Updated last week
- Automated Design of Agentic Systems☆10Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Updated 3 years ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 5 months ago
- Our solution for the arc challenge 2024☆186Updated 6 months ago
- ☆29Updated last year
- Bootstrapping ARC☆153Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- ☆164Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- ☆113Updated 3 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆305Updated 3 weeks ago
- ☆28Updated 5 months ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆25Updated last year
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆306Updated this week
- ☆116Updated last week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆773Updated this week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- ☆150Updated 4 months ago
- A python framework to streamline your ARC challenge solutions. From graphical displays to optimized Kaggle submissions☆13Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆346Updated last year