richardblythman / awesome-multi-agent-systemsLinks
A curated list of awesome resources, libraries, frameworks, and tools for multi-agent systems (MAS) research and development.
☆19Updated 10 months ago
Alternatives and similar repositories for awesome-multi-agent-systems
Users that are interested in awesome-multi-agent-systems are comparing it to the libraries listed below
Sorting:
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆454Updated last year
- Naptha is a framework and infrastructure for developing and running multi-agent systems at scale with heterogeneous models, architectures…☆179Updated 9 months ago
- A simulated operating system design for AI Agents to interact with the world☆177Updated last year
- ☆175Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆324Updated 2 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆315Updated 6 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated 3 months ago
- A curated list of awesome approaches to AI model routing☆180Updated 9 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 10 months ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆15Updated 9 months ago
- ☆240Updated last month
- ☆213Updated 2 weeks ago
- A dynamic forecasting benchmark for LLMs☆49Updated this week
- A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o,…☆113Updated 8 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆264Updated this week
- summaries of ai research☆53Updated 8 months ago
- Claude Deep Research config for Claude Code.☆224Updated 9 months ago
- ☆618Updated 4 months ago
- ⚖️ Awesome LLM Judges ⚖️☆148Updated 8 months ago
- Task-based Agentic Framework using StrictJSON as the core☆462Updated last month
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆306Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆313Updated last year
- ☆32Updated 7 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆236Updated 7 months ago
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Morpheus Local Agents☆51Updated 2 weeks ago
- ☆135Updated 9 months ago
- Letting Claude Code develop his own MCP tools :)☆122Updated 10 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆339Updated this week
- AWM: Agent Workflow Memory☆376Updated 3 weeks ago