BKHMSI / mixture-of-cognitive-reasonersLinks
Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
☆33Updated 2 weeks ago
Alternatives and similar repositories for mixture-of-cognitive-reasoners
Users that are interested in mixture-of-cognitive-reasoners are comparing it to the libraries listed below
Sorting:
- ☆27Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆79Updated 7 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 6 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated 2 months ago
- ☆19Updated 7 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated 2 months ago
- ☆17Updated 4 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆32Updated 3 weeks ago
- Improving AI Systems with Self-Defense Mechanisms☆20Updated 8 months ago
- ☆44Updated last month
- ☆40Updated 5 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated last month
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆99Updated this week
- Resa: Transparent Reasoning Models via SAEs☆44Updated last month
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆71Updated 3 months ago
- Code for ExploreTom☆86Updated 4 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆53Updated 2 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Updated 6 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆109Updated 3 weeks ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 2 weeks ago
- ☆58Updated 4 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 6 months ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆50Updated 6 months ago
- ☆67Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Demystifying Reinforcement Learning in Agentic Reasoning☆104Updated 2 weeks ago
- ☆18Updated 3 months ago