IntologyAI / ZochiLinks
Repository for Zochi's Research
☆238Updated last week
Alternatives and similar repositories for Zochi
Users that are interested in Zochi are comparing it to the libraries listed below
Sorting:
- ☆334Updated 3 weeks ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆164Updated this week
- ☆210Updated 4 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆319Updated last week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆150Updated last month
- ☆142Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆226Updated 2 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆323Updated this week
- ☆230Updated last month
- A lightweight framework for building research agents designed for developers☆101Updated this week
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆209Updated this week
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆114Updated 5 months ago
- ☆210Updated 2 weeks ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆218Updated last week
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆77Updated 2 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆244Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆295Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- Tina: Tiny Reasoning Models via LoRA☆268Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆385Updated last week
- Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆184Updated this week
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆216Updated 3 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆106Updated 4 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆228Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated 3 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆75Updated last month
- CodeScientist: An automated scientific discovery system for code-based experiments☆275Updated 3 weeks ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆267Updated 4 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆168Updated this week