jerber / arc-lang-publicLinks
☆203Updated this week
Alternatives and similar repositories for arc-lang-public
Users that are interested in arc-lang-public are comparing it to the libraries listed below
Sorting:
- SoTA Approach for ARC-AGI 2☆126Updated last month
- The State Of The Art, intelligence☆154Updated 2 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆100Updated this week
- Inference-time scaling for LLMs-as-a-judge.☆304Updated last month
- ☆233Updated 7 months ago
- A framework for optimizing DSPy programs with RL☆208Updated last week
- explore token trajectory trees on instruct and base models☆148Updated 5 months ago
- Claude Deep Research config for Claude Code.☆223Updated 7 months ago
- Testing baseline LLMs performance across various models☆319Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆109Updated 7 months ago
- look how they massacred my boy☆63Updated last year
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆290Updated 2 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 4 months ago
- ☆170Updated 10 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆202Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆90Updated 3 weeks ago
- ☆123Updated last year
- ☆124Updated 10 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆97Updated 6 months ago
- a curated list of data for reasoning ai☆140Updated last year
- ☆62Updated 3 months ago
- ☆89Updated 9 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 5 months ago
- ☆32Updated last year
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated 2 weeks ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆106Updated 8 months ago
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated last year
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆349Updated 10 months ago
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year