charlesjin / emergent-semanticsLinks
☆41Updated 11 months ago
Alternatives and similar repositories for emergent-semantics
Users that are interested in emergent-semantics are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 2 months ago
- ☆65Updated 2 months ago
- ☆56Updated 6 months ago
- ☆24Updated 9 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆53Updated 3 weeks ago
- ☆20Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆72Updated last week
- ☆21Updated last month
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆57Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Code repo for MathAgent☆16Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆24Updated last month
- ☆41Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆18Updated last month
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- LLM reads a paper and produce a working prototype☆57Updated 2 months ago
- ☆11Updated 11 months ago
- A multimodal agent that can interact with its own PC in a multimodal manner.☆26Updated this week
- Official Code Release for "Training a Generally Curious Agent"☆25Updated last month
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 7 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 6 months ago
- ☆35Updated 3 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆17Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆90Updated last month
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆39Updated 7 months ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆73Updated 3 weeks ago