QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆26Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for qalign
Users that are interested in qalign are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- ☆24Apr 3, 2025Updated 11 months ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 7 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆64Jul 6, 2025Updated 7 months ago
- Lego for GRPO☆30May 27, 2025Updated 9 months ago
- ☆29Feb 24, 2025Updated last year
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated last month
- A comprehensive React Native starter template built with Expo. It includes reusable UI components, Poppins font setup, NativeWind, Fireba…☆23Updated this week
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆14Nov 11, 2025Updated 3 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Apr 22, 2025Updated 10 months ago
- A lightweight, open, and extensible multi-LLM interaction studio.☆75Oct 28, 2025Updated 4 months ago
- Go-based LLM proxy for cost tracking and rate limiting☆16Oct 9, 2025Updated 4 months ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 9 months ago
- Primus-SaFE(Stability and Fault Endurance)☆52Updated this week
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 11 months ago
- Fully local web research and report writing assistant☆39Jan 28, 2025Updated last year
- Live reloadable remote function management in Python☆11May 28, 2025Updated 9 months ago
- AI Tasks. A LLM integrated agent orchestration tool for Liferay.☆14May 16, 2025Updated 9 months ago
- ☆34Sep 22, 2025Updated 5 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- LLM Skirmish☆44Feb 3, 2026Updated last month
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- Curiosity driven exploration of your complex system 👀☆43Jan 30, 2025Updated last year
- React Native, Right Now (rn-rn)☆18Sep 2, 2025Updated 6 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.☆13Feb 10, 2025Updated last year
- A Discord bot to retrieve Shopify Orders and Statistics☆10Dec 9, 2025Updated 2 months ago
- Shakey OS Mobile AI Framework for React Native allowing people to build React Native apps for IOS and Android with AI tooling and wallet …☆28Feb 3, 2025Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Extract annotated misspellings from MIMIC-III.☆13Dec 17, 2020Updated 5 years ago
- ☆14Apr 4, 2025Updated 11 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- "Open-source toolkit (Python Library, Registry API, CLI) for secure, decentralized AI agent interoperability using A2A/MCP."☆14May 10, 2025Updated 9 months ago
- A library for analyzing Quantum Error Correction Codes from their Tanner Graphs.☆14Aug 23, 2025Updated 6 months ago
- Apache Geronimo Microprofile OpenAPI Implementation☆12Nov 15, 2022Updated 3 years ago