fannie1208 / W4SLinks
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
☆26Updated 2 months ago
Alternatives and similar repositories for W4S
Users that are interested in W4S are comparing it to the libraries listed below
Sorting:
- A trainable user simulator☆34Updated 2 weeks ago
- A Comprehensive Library for Memory of LLM-based Agents.☆52Updated 2 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆29Updated last year
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆96Updated 2 weeks ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- ☆41Updated 8 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆78Updated last year
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆41Updated last month
- ☆12Updated 7 months ago
- ☆47Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- ☆22Updated last year
- ☆50Updated last month
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆24Updated last year
- ☆57Updated 3 weeks ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆77Updated 8 months ago
- This is the code of MMOA-RAG.☆60Updated 2 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated 2 months ago
- ☆45Updated 8 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆103Updated 2 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆95Updated last week
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆67Updated 7 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆66Updated last year
- ☆46Updated 7 months ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆50Updated this week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆144Updated last week
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 5 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆28Updated 8 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆130Updated last month
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆38Updated this week