cloudygoose / MiniAgentsLinks
The MiniAgents visualization tool for simulacra.
β17Updated last year
Alternatives and similar repositories for MiniAgents
Users that are interested in MiniAgents are comparing it to the libraries listed below
Sorting:
- π₯π₯π₯Latest Papers, Codes on Uncertainty-based RLβ50Updated last month
- MAT: Multi-modal Agent Tuning π₯ ICLR 2025 (Spotlight)β65Updated 4 months ago
- β27Updated 2 years ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chainsβ57Updated 2 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGIβ186Updated this week
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"β34Updated last month
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoningβ88Updated this week
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorialsβ42Updated 7 months ago
- Resources for the Enigmata Project.β71Updated 2 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agentsβ192Updated 5 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static tβ¦β45Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β170Updated last week
- β171Updated 5 months ago
- π This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.β237Updated 3 weeks ago
- β21Updated 5 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β81Updated 8 months ago
- β108Updated last month
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabβ¦β117Updated 2 months ago
- β82Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". β¦β13Updated 2 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ59Updated 10 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaβ¦β31Updated last week
- γICLR 2025 π₯γThe code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overcoβ¦β45Updated 6 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluationβ58Updated 6 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β56Updated 10 months ago
- [ICLR 2025] ChartMimic: Evaluating LMMβs Cross-Modal Reasoning Capability via Chart-to-Code Generationβ124Updated 4 months ago
- Official Repository of LatentSeekβ64Updated 4 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ306Updated 2 weeks ago
- A comprehensive collection of process reward models.β111Updated 2 weeks ago
- β13Updated last year