Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆189Mar 7, 2025Updated 11 months ago
Alternatives and similar repositories for Archon
Users that are interested in Archon are comparing it to the libraries listed below
Sorting:
- ☆112Sep 25, 2024Updated last year
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- ☆21Jul 25, 2025Updated 7 months ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated last year
- Optimizing inference proxy for LLMs☆3,342Jan 28, 2026Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- ☆123Feb 21, 2025Updated last year
- AWM: Agent Workflow Memory☆397Dec 22, 2025Updated 2 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Hydragen: High-Throughput LLM Inference with Shared Prefixes☆48May 10, 2024Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Apr 9, 2025Updated 10 months ago
- ☆41Jun 19, 2024Updated last year
- ☆17Jan 9, 2025Updated last year
- GRadient-INformed MoE☆264Sep 25, 2024Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆154Feb 3, 2025Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,628Updated this week
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- ☆18Jun 3, 2024Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- ☆24Dec 11, 2024Updated last year
- ☆33Jul 9, 2025Updated 7 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆633Jan 29, 2026Updated last month
- ☆46Feb 8, 2024Updated 2 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- ☆27Sep 11, 2024Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- ☆1,033Dec 17, 2024Updated last year
- AllenAI's post-training codebase☆3,592Feb 24, 2026Updated last week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆180Jul 8, 2025Updated 7 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆203Jul 17, 2024Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆186May 25, 2025Updated 9 months ago
- Your Python AI Coder!☆36May 21, 2025Updated 9 months ago
- ☆28Oct 2, 2025Updated 5 months ago
- Predict the performance of LLM inference services☆21Sep 18, 2025Updated 5 months ago
- A compact LLM pretrained in 9 days by using high quality data☆339Apr 9, 2025Updated 10 months ago