cloudygoose / MiniAgentsLinks
The MiniAgents visualization tool for simulacra.
β17Updated last year
Alternatives and similar repositories for MiniAgents
Users that are interested in MiniAgents are comparing it to the libraries listed below
Sorting:
- β27Updated 2 years ago
- MAT: Multi-modal Agent Tuning π₯ ICLR 2025 (Spotlight)β77Updated 6 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"β51Updated this week
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β85Updated 10 months ago
- [ICML 2025] Official Implementation of GLIDERβ71Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β56Updated last year
- Official Repository of LatentSeekβ70Updated 6 months ago
- Resources for the Enigmata Project.β73Updated 4 months ago
- β189Updated 7 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGIβ220Updated 2 months ago
- β27Updated 8 months ago
- β190Updated last month
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ63Updated last year
- γNeurIPS 2024γThe implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185β22Updated 6 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β235Updated last week
- A comprehensive collection of process reward models.β127Updated 2 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static tβ¦β47Updated 3 months ago
- β52Updated 6 months ago
- β73Updated last month
- β346Updated 4 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ319Updated 2 months ago
- π₯π₯π₯Latest Papers, Codes on Uncertainty-based RLβ56Updated 3 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorialsβ45Updated 9 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environmentsβ90Updated 7 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ61Updated 6 months ago
- Extrapolating RLVR to General Domains without Verifiersβ184Updated 4 months ago
- OS-Sentinelβ37Updated last month
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmaβ¦β68Updated 5 months ago
- β13Updated last year
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β119Updated last year