cloudygoose / MiniAgentsLinks
The MiniAgents visualization tool for simulacra.
☆17Updated last year
Alternatives and similar repositories for MiniAgents
Users that are interested in MiniAgents are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- ☆179Updated 5 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆83Updated 8 months ago
- Resources for the Enigmata Project.☆72Updated 2 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆61Updated 11 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆40Updated 2 months ago
- [ICML 2025] Official Implementation of GLIDER☆66Updated last month
- Description for MV-MATH☆15Updated 3 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated 11 months ago
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆119Updated last week
- ☆52Updated 5 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆46Updated last month
- ☆13Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆124Updated 2 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆68Updated 4 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆60Updated 7 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆188Updated last week
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆51Updated 2 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆47Updated 8 months ago
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆16Updated last year
- my commonly-used tools☆63Updated 10 months ago
- ☆40Updated last week
- FeatureAlignment = Alignment + Mechanistic Interpretability☆31Updated 8 months ago
- ☆29Updated 7 months ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆124Updated 4 months ago
- ☆213Updated 7 months ago
- Official Repository of LatentSeek☆66Updated 5 months ago
- A comprehensive collection of process reward models.☆116Updated last month