Just-A-Pie / MiniAgentStudioLinks
☆25Updated 3 months ago
Alternatives and similar repositories for MiniAgentStudio
Users that are interested in MiniAgentStudio are comparing it to the libraries listed below
Sorting:
- Awesome papers for role-playing with language models☆218Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆172Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆415Updated 3 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆510Updated last month
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆175Updated 2 years ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 5 months ago
- The awesome agents in the era of large language models☆71Updated 2 years ago
- Membenchmark repository☆46Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago
- ☆56Updated last year
- ☆41Updated 2 months ago
- Source code and demo for memory bank and SiliconFriend☆402Updated 2 years ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆418Updated 2 months ago
- [COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale☆107Updated 2 months ago
- A version of verl to support diverse tool use☆860Updated last month
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆95Updated 2 weeks ago
- Latest Advances on Long Chain-of-Thought Reasoning☆607Updated 6 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆276Updated last week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆155Updated 5 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆154Updated 3 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Updated last year
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆98Updated 6 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆249Updated 9 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆278Updated 2 weeks ago
- Building a comprehensive and handy list of papers for GUI agents☆628Updated 3 months ago
- ☆224Updated 10 months ago
- ☆186Updated 3 weeks ago
- ☆239Updated last month
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Updated 10 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Updated last year