IntologyAI / Zochi
Repository for Zochi's Research
☆60Updated last month
Alternatives and similar repositories for Zochi
Users that are interested in Zochi are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆146Updated 2 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆58Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆193Updated last week
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆54Updated 9 months ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆16Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆52Updated 2 months ago
- ☆27Updated this week
- ☆114Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆146Updated 3 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆105Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆132Updated last month
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated last week
- ☆288Updated 10 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆76Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- [ICML 2025] A platform for developers to simulate collaborative research activities☆154Updated this week
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆73Updated last month
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆156Updated this week
- Agentic Knowledgeable Self-awareness☆56Updated last month
- ☆97Updated 10 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆222Updated 6 months ago
- ☆142Updated last year
- Official Implementation of "Reasoning Language Models: A Blueprint"☆60Updated 3 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆119Updated this week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆146Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆85Updated 2 weeks ago