cloudygoose / MiniAgentsLinks
The MiniAgents visualization tool for simulacra.
☆17Updated last year
Alternatives and similar repositories for MiniAgents
Users that are interested in MiniAgents are comparing it to the libraries listed below
Sorting:
- ☆204Updated last month
- ☆27Updated 2 years ago
- Official Repository of LatentSeek☆76Updated 8 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆63Updated last year
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆50Updated 11 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆88Updated 11 months ago
- [ICML 2025] Official Implementation of GLIDER☆72Updated 4 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Updated 10 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆276Updated last week
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- Resources for the Enigmata Project.☆77Updated 5 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆68Updated last month
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- ☆352Updated 6 months ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆48Updated 10 months ago
- A comprehensive collection of process reward models.☆136Updated 4 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆160Updated 3 months ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆54Updated 7 months ago
- Automated bibliography verification and LaTeX quality auditing for papers.☆78Updated 2 weeks ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆76Updated 6 months ago
- ☆38Updated last year
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆201Updated 11 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated last year
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Updated 5 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆250Updated 3 months ago
- my commonly-used tools☆64Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Updated 11 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆352Updated 3 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆344Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆419Updated 6 months ago