llSourcell / O1-nano
This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview
☆99Updated 5 months ago
Alternatives and similar repositories for O1-nano:
Users that are interested in O1-nano are comparing it to the libraries listed below
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆86Updated 2 months ago
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆75Updated 3 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆107Updated last month
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆39Updated 3 months ago
- Code for ScribeAgent paper☆54Updated 3 weeks ago
- ☆50Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆52Updated last week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆103Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- ☆117Updated 7 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆48Updated last month
- ☆84Updated 6 months ago
- ☆185Updated last month
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated last month
- ☆151Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated last week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆112Updated last week
- Open Agent Computer Interface☆59Updated 4 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆29Updated last month
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆160Updated 6 months ago
- RAG example using DSPy, Gradio, FastAPI☆75Updated 11 months ago
- ☆61Updated last month
- Deep Research through Multi-Agents, using GraphRAG☆62Updated 4 months ago
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- Simple Graph Memory for AI applications☆84Updated 8 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆92Updated 5 months ago