jerber / arc-lang-publicLinks
☆164Updated last week
Alternatives and similar repositories for arc-lang-public
Users that are interested in arc-lang-public are comparing it to the libraries listed below
Sorting:
- SoTA Approach for ARC-AGI 2☆97Updated 3 weeks ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆97Updated this week
- Inference-time scaling for LLMs-as-a-judge.☆300Updated last week
- explore token trajectory trees on instruct and base models☆133Updated 4 months ago
- ☆233Updated 7 months ago
- The State Of The Art, intelligence☆152Updated last month
- ☆166Updated 9 months ago
- Train your own SOTA deductive reasoning model☆107Updated 7 months ago
- ☆32Updated last year
- look how they massacred my boy☆63Updated 11 months ago
- ☆999Updated this week
- Testing baseline LLMs performance across various models☆311Updated last week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆449Updated last year
- A framework for optimizing DSPy programs with RL☆185Updated 2 weeks ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 4 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- The history files when recording human interaction while solving ARC tasks☆116Updated this week
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆290Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 11 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 3 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆274Updated 11 months ago
- ☆123Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 7 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆57Updated 9 months ago
- ☆135Updated 6 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆265Updated last month
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆72Updated 7 months ago
- Claude Deep Research config for Claude Code.☆220Updated 6 months ago
- ☆60Updated 2 months ago
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year