BKHMSI / mixture-of-cognitive-reasonersLinks
☆24Updated 3 weeks ago
Alternatives and similar repositories for mixture-of-cognitive-reasoners
Users that are interested in mixture-of-cognitive-reasoners are comparing it to the libraries listed below
Sorting:
- ☆23Updated 3 weeks ago
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆28Updated 3 weeks ago
- Resa: Transparent Reasoning Models via SAEs☆39Updated last month
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆19Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 3 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆46Updated 2 months ago
- ☆20Updated 3 months ago
- ☆67Updated 2 weeks ago
- Esoteric Language Models☆87Updated 2 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated 3 weeks ago
- A simple script to see how my ideas evolve over time☆41Updated last month
- ☆13Updated 3 weeks ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆94Updated 2 months ago
- Official Code Release for "Training a Generally Curious Agent"☆26Updated last month
- Official Repository for Task-Circuit Quantization☆20Updated last month
- ☆36Updated last month
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 2 months ago
- Fork of Flame repo for training of some new stuff in development☆14Updated 3 weeks ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated 2 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆91Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- Official repo of paper LM2☆41Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- ☆66Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month