jerber / arc-lang-publicLinks
☆313Updated last month
Alternatives and similar repositories for arc-lang-public
Users that are interested in arc-lang-public are comparing it to the libraries listed below
Sorting:
- SoTA Approach for ARC-AGI 2☆157Updated 4 months ago
- The State Of The Art, intelligence☆157Updated 5 months ago
- 🧬 The Huxley-Gödel Machine☆319Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆138Updated this week
- ☆253Updated 10 months ago
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆1,140Updated last month
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆141Updated last week
- explore token trajectory trees on instruct and base models☆150Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆435Updated 2 weeks ago
- Inference-time scaling for LLMs-as-a-judge.☆325Updated 2 months ago
- Testing baseline LLMs performance across various models☆335Updated last week
- Frontier Models playing the board game Diplomacy.☆620Updated 3 weeks ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆315Updated 6 months ago
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆352Updated last year
- Plotting (entropy, varentropy) for small LMs☆99Updated 8 months ago
- ☆90Updated last year
- A framework for optimizing DSPy programs with RL☆305Updated last week
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆274Updated 2 months ago
- Prompts used in the Automated Auditing Blog Post☆134Updated 5 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 10 months ago
- ☆189Updated last year
- Curated collection of community environments☆204Updated last week
- ☆32Updated last year
- ☆139Updated 10 months ago
- ☆34Updated 10 months ago
- The history files when recording human interaction while solving ARC tasks☆118Updated last week
- ☆105Updated 5 months ago
- ☆67Updated 6 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆338Updated 4 months ago