jerber / arc-lang-publicLinks
☆313Updated 2 months ago
Alternatives and similar repositories for arc-lang-public
Users that are interested in arc-lang-public are comparing it to the libraries listed below
Sorting:
- SoTA Approach for ARC-AGI 2☆159Updated 4 months ago
- The State Of The Art, intelligence☆157Updated 6 months ago
- explore token trajectory trees on instruct and base models☆150Updated 8 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆151Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆636Updated last month
- Inference-time scaling for LLMs-as-a-judge.☆328Updated 3 months ago
- 🧬 The Huxley-Gödel Machine☆324Updated this week
- A framework for optimizing DSPy programs with RL☆313Updated last month
- Prompts used in the Automated Auditing Blog Post☆137Updated 6 months ago
- Plotting (entropy, varentropy) for small LMs☆99Updated 8 months ago
- Testing baseline LLMs performance across various models☆336Updated this week
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- ☆67Updated 7 months ago
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆176Updated 3 weeks ago
- look how they massacred my boy☆63Updated last year
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 11 months ago
- Lightly-reviewed collection of community environments☆212Updated this week
- A framework for orchestrating AI agents using a mermaid graph☆76Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆318Updated 7 months ago
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆1,197Updated last month
- ☆190Updated last year
- Benchmark for LLMs playing full press diplomacy☆56Updated 11 months ago
- Claude Deep Research config for Claude Code.☆226Updated 10 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 9 months ago
- ☆125Updated last year
- ☆32Updated last year
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- ☆134Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆458Updated last year