ScalingIntelligence / ArchonLinks
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆190Updated 9 months ago
Alternatives and similar repositories for Archon
Users that are interested in Archon are comparing it to the libraries listed below
Sorting:
- ☆125Updated 10 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- ☆136Updated 9 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 5 months ago
- Storing long contexts in tiny caches with self-study☆226Updated 2 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 5 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆405Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 9 months ago
- Replicating O1 inference-time scaling laws☆91Updated last year
- ☆59Updated 10 months ago
- Curated collection of community environments☆195Updated last week
- ☆90Updated this week
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- ☆111Updated last year
- ☆207Updated last week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆221Updated last week
- Evaluating LLMs with fewer examples☆170Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆229Updated this week
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆330Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆111Updated 8 months ago
- A simple unified framework for evaluating LLMs☆257Updated 8 months ago