IntologyAI / ZochiLinks
Repository for Zochi's Research
☆267Updated 3 weeks ago
Alternatives and similar repositories for Zochi
Users that are interested in Zochi are comparing it to the libraries listed below
Sorting:
- ☆472Updated 2 weeks ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆122Updated 3 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆160Updated 3 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆173Updated this week
- ☆350Updated last month
- Code for the paper: "Learning to Reason without External Rewards"☆353Updated 2 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆161Updated this week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆406Updated last week
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆261Updated 4 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆244Updated 4 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆389Updated last month
- ☆274Updated last month
- ☆205Updated last month
- ☆214Updated 6 months ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆47Updated 2 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆357Updated this week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆258Updated 3 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆126Updated 7 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆81Updated 3 months ago
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆230Updated last month
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆261Updated 3 weeks ago
- The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"☆168Updated last month
- ☆178Updated last month
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆105Updated 3 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆97Updated 5 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆178Updated 3 months ago
- [ACL 2025] Multi-Agent System for Science of Science☆57Updated last month
- CycleResearcher: Improving Automated Research via Automated Review☆240Updated 2 months ago
- A banchmark list for evaluation of large language models.☆141Updated last week
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆271Updated 7 months ago