charlesjin / emergent-semanticsLinks
☆41Updated last year
Alternatives and similar repositories for emergent-semantics
Users that are interested in emergent-semantics are comparing it to the libraries listed below
Sorting:
- ☆67Updated 8 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆35Updated 2 months ago
- ☆19Updated 9 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 8 months ago
- ☆63Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆59Updated this week
- The original Shared Recurrent Memory Transformer implementation☆33Updated 5 months ago
- ☆40Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆11Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 6 months ago
- ☆24Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 6 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 6 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- ☆21Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 3 months ago
- LIMI: Less is More for Agency☆154Updated 2 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 8 months ago
- CursorCore: Assist Programming through Aligning Anything☆133Updated 10 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- The Library for LLM-based multi-agent applications☆91Updated 5 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago