☆100Feb 24, 2026Updated last week
Alternatives and similar repositories for SnakeBench
Users that are interested in SnakeBench are comparing it to the libraries listed below
Sorting:
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- ☆27Aug 16, 2025Updated 6 months ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆21Sep 18, 2025Updated 5 months ago
- ☆23Apr 4, 2024Updated last year
- ☆55Nov 22, 2024Updated last year
- A python framework to streamline your ARC challenge solutions. From graphical displays to optimized Kaggle submissions☆13Oct 17, 2024Updated last year
- ☆15Jun 19, 2025Updated 8 months ago
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- Give langchain access to the terminal☆32Apr 10, 2023Updated 2 years ago
- Model Context Protocol (MCP) server to capture images from an OpenCV-compatible webcam or video source☆16Mar 28, 2025Updated 11 months ago
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Mar 15, 2025Updated 11 months ago
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Nov 9, 2023Updated 2 years ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Training tiny models to prove hard theorems☆41Feb 15, 2026Updated 2 weeks ago
- A quick implementation of diffusion language models.☆48Oct 11, 2025Updated 4 months ago
- ☆19May 11, 2024Updated last year
- ☆20Nov 4, 2025Updated 4 months ago
- ☆60Jan 28, 2025Updated last year
- My writings about ARC (Abstraction and Reasoning Corpus)☆91Dec 9, 2025Updated 2 months ago
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- A Mimetic Procedural Benchmark Generator for the Abstraction and Reasoning Corpus☆41Jan 24, 2026Updated last month
- Lambda durable functions SDK, Testing SDK and fully functional examples☆72Updated this week
- ARC gym: a data generation framework for the Abstraction & Reasoning Corpus☆25Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Feb 11, 2026Updated 3 weeks ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆321Oct 11, 2024Updated last year
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,042Apr 27, 2025Updated 10 months ago
- ☆19Jun 29, 2020Updated 5 years ago
- Reverse Engineering the Abstraction and Reasoning Corpus☆333Feb 24, 2025Updated last year
- An efficient GRPO training util.☆54Jun 13, 2025Updated 8 months ago
- Abstract Reasoning with Graph Abstractions (ARGA) implementation☆61Jul 5, 2024Updated last year
- ☆76Feb 18, 2026Updated 2 weeks ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆111May 22, 2025Updated 9 months ago
- ☆67Jul 11, 2025Updated 7 months ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆136Oct 27, 2025Updated 4 months ago
- Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge☆163Jul 10, 2025Updated 7 months ago
- ☆27Jul 9, 2024Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 10 months ago
- A lightweight code assistant with tool-using capabilities built on HuggingFace's smolagents.☆41Jun 11, 2025Updated 8 months ago