Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆204Mar 7, 2025Updated last year
Alternatives and similar repositories for Archon
Users that are interested in Archon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆116Sep 25, 2024Updated last year
- Simple, flexible configuration in pure Python!☆32Jul 1, 2025Updated last year
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Hydragen: High-Throughput LLM Inference with Shared Prefixes☆52May 10, 2024Updated 2 years ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆56Oct 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆46Feb 8, 2024Updated 2 years ago
- ☆20May 14, 2025Updated last year
- ☆59Jan 28, 2025Updated last year
- Optimizing inference proxy for LLMs☆4,167May 7, 2026Updated last month
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆1,080Mar 24, 2026Updated 3 months ago
- ☆125Jun 2, 2026Updated 3 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆153Feb 3, 2025Updated last year
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆664Jan 29, 2026Updated 5 months ago
- ☆40Jun 19, 2024Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- AWM: Agent Workflow Memory☆444Dec 22, 2025Updated 6 months ago
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 9 months ago
- Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.☆36Sep 26, 2024Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆120Apr 27, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SkyRL: A Modular Full-stack RL Library for LLMs☆2,045Updated this week
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- AllenAI's post-training codebase☆3,775Updated this week
- Your Python AI Coder!☆36May 21, 2025Updated last year
- GRadient-INformed MoE☆264Sep 25, 2024Updated last year
- ☆27Sep 11, 2024Updated last year
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated 2 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 10 months ago
- ☆17Dec 16, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆183Jul 8, 2025Updated 11 months ago
- ☆1,034Dec 17, 2024Updated last year
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆105Jul 19, 2025Updated 11 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆426Updated this week
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year