vsubramaniam851/multiagent-ft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vsubramaniam851/multiagent-ft)

vsubramaniam851 / multiagent-ft

☆234

Alternatives and similar repositories for multiagent-ft

Users that are interested in multiagent-ft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shalev-Lifshitz / MultiAgentVerification
View on GitHub
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
☆33Mar 1, 2025Updated last year
zou-group / sirius
View on GitHub
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
☆108Dec 1, 2025Updated 7 months ago
hkust-nlp / B-STaR
View on GitHub
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86May 21, 2025Updated last year
zhentingqi / rStar
View on GitHub
☆972Jan 23, 2025Updated last year
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
SakanaAI / self-adaptive-llms
View on GitHub
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,221Jan 30, 2025Updated last year
WooooDyy / MathCritique
View on GitHub
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆55Nov 29, 2024Updated last year
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,865Mar 18, 2025Updated last year
SalesforceAIResearch / LaTRO
View on GitHub
☆127Jun 2, 2026Updated last month
THU-KEG / Agentic-Reward-Modeling
View on GitHub
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆134Jun 11, 2025Updated last year
waterhorse1 / Natural-language-RL
View on GitHub
Natural Language Reinforcement Learning
☆101Jul 30, 2025Updated 11 months ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Feb 4, 2026Updated 5 months ago
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
ShengranHu / ADAS
View on GitHub
[ICLR 2025] Automated Design of Agentic Systems
☆1,619Jan 28, 2025Updated last year
RUC-NLPIR / Search-o1
View on GitHub
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
☆1,240Nov 17, 2025Updated 8 months ago
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
facebookresearch / MLGym
View on GitHub
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆613Aug 10, 2025Updated 11 months ago
DavidFanzz / llm_decoding
View on GitHub
☆12Apr 25, 2025Updated last year
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,103Apr 15, 2026Updated 3 months ago
ElevenLiy / MAKGED
View on GitHub
MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.
☆31Jul 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ATH-MaaS / Marco-o1
View on GitHub
An Open Large Reasoning Model for Real-World Solutions
☆1,537Jun 17, 2026Updated last month
SLAB-NLP / Multi-Prompt-LLM-Evaluation
View on GitHub
State of What Art? A Call for Multi-Prompt LLM Evaluation
☆16Apr 10, 2026Updated 3 months ago
facebookresearch / coconut
View on GitHub
Training Large Language Model to Reason in a Continuous Latent Space
☆1,667Jul 2, 2026Updated 3 weeks ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
vivekmyers / tra-ogbench
View on GitHub
☆18Feb 13, 2025Updated last year
LuLuLuyi / LongHeads
View on GitHub
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆32Apr 8, 2024Updated 2 years ago
google-deepmind / latent-multi-hop-reasoning
View on GitHub
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆92Mar 18, 2025Updated last year
SLIT-AI / FuseChat-3.0
View on GitHub
☆18Apr 18, 2025Updated last year
sail-sg / oat
View on GitHub
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆666Jan 29, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
huggingface / search-and-learn
View on GitHub
Recipes to scale inference-time compute of open models
☆1,130May 26, 2026Updated 2 months ago
ByteDance-Seed / Agent-R
View on GitHub
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆174Oct 20, 2025Updated 9 months ago
composable-models / llm_multiagent_debate
View on GitHub
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
☆544Apr 24, 2025Updated last year
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,810Updated this week
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
snu-mllab / DiscreteBlockBayesAttack
View on GitHub
Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…
☆26Sep 26, 2023Updated 2 years ago
agentic-learning-ai-lab / anticipatory-recovery
View on GitHub
Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"
☆11Oct 27, 2025Updated 8 months ago