IntologyAI / ZochiLinks
Repository for Zochi's Research
☆300Updated 2 months ago
Alternatives and similar repositories for Zochi
Users that are interested in Zochi are comparing it to the libraries listed below
Sorting:
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆192Updated this week
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆405Updated 2 months ago
- [ICLR 2026] Learning to Reason without External Rewards☆389Updated 2 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆169Updated 3 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆217Updated 3 months ago
- ☆229Updated 11 months ago
- ☆371Updated 6 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆177Updated 3 months ago
- ☆275Updated 5 months ago
- ☆849Updated 3 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261Updated 9 months ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆56Updated 7 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆126Updated last week
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆427Updated 2 weeks ago
- A banchmark list for evaluation of large language models.☆159Updated 3 weeks ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆239Updated 2 months ago
- ☆388Updated 3 months ago
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆308Updated 3 months ago
- ☆331Updated 6 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆283Updated 4 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆273Updated 3 months ago
- ☆352Updated 6 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆298Updated last week
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆519Updated 4 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆534Updated 5 months ago
- ☆211Updated 6 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆180Updated 7 months ago
- Open-source Agentic RL for LLMs — RLAnything & DemyAgent☆223Updated last week
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆576Updated this week
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆307Updated 2 weeks ago