cloudygoose / MiniAgentsLinks
The MiniAgents visualization tool for simulacra.
☆17Updated last year
Alternatives and similar repositories for MiniAgents
Users that are interested in MiniAgents are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆73Updated 5 months ago
- Resources for the Enigmata Project.☆73Updated 3 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆194Updated 2 weeks ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆84Updated 9 months ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆52Updated 3 months ago
- [ICML 2025] Official Implementation of GLIDER☆67Updated last month
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆205Updated last month
- ☆52Updated 6 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆63Updated 11 months ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆45Updated 7 months ago
- ☆184Updated 6 months ago
- Official Repository of LatentSeek☆69Updated 5 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…☆69Updated 4 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated last year
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆47Updated 2 months ago
- A comprehensive collection of process reward models.☆122Updated last month
- Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.☆49Updated 2 weeks ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆125Updated last month
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆15Updated last week
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆48Updated 5 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆31Updated 8 months ago
- ☆29Updated 7 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆45Updated 3 months ago
- Description for MV-MATH☆15Updated 4 months ago
- 【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185☆22Updated 6 months ago
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆18Updated 8 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆46Updated 9 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆69Updated 4 months ago
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆16Updated last year