ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆119Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Archon
- ☆50Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆100Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆91Updated 4 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆110Updated 4 months ago
- ☆89Updated 4 months ago
- A simple unified framework for evaluating LLMs☆138Updated this week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆160Updated last month
- ☆111Updated last month
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆104Updated last month
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆39Updated 3 months ago
- ☆74Updated 2 weeks ago
- ☆49Updated 6 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆142Updated this week
- ☆61Updated 2 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- ☆105Updated this week
- code for training & evaluating Contextual Document Embedding models☆93Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- ☆39Updated 9 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆31Updated last week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆38Updated 3 weeks ago
- ☆101Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆128Updated this week
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- Simple and efficient pytorch-native transformer training and inference (batched)☆61Updated 7 months ago