Just-A-Pie / MiniAgentStudioLinks
☆21Updated 3 weeks ago
Alternatives and similar repositories for MiniAgentStudio
Users that are interested in MiniAgentStudio are comparing it to the libraries listed below
Sorting:
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆244Updated 3 weeks ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆87Updated 4 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆134Updated last month
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆18Updated last month
- ☆11Updated last year
- The awesome agents in the era of large language models☆68Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆321Updated last year
- ☆43Updated 5 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆154Updated this week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆103Updated 3 weeks ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆21Updated last week
- A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …☆34Updated this week
- ☆15Updated last year
- ☆51Updated last year
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆69Updated 5 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆155Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆224Updated 2 weeks ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆427Updated 7 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆59Updated last year
- Awesome RL Reasoning Recipes ("Triple R")☆804Updated 3 weeks ago
- ☆31Updated 3 months ago
- Large Language Models(LLMs) of Code☆18Updated 2 years ago
- ☆21Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆288Updated last month
- Accepted by ACL 2025☆31Updated 3 weeks ago
- A series of technical report on Slow Thinking with LLM☆726Updated 3 weeks ago
- ☆24Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆18Updated last year
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆591Updated last week
- ☆51Updated 2 months ago