llSourcell / O1-nanoLinks
This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview
☆97Updated 8 months ago
Alternatives and similar repositories for O1-nano
Users that are interested in O1-nano are comparing it to the libraries listed below
Sorting:
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆88Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated last month
- ☆86Updated 9 months ago
- Code for ScribeAgent paper☆58Updated 4 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆42Updated 6 months ago
- ☆94Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 5 months ago
- ☆51Updated 3 weeks ago
- ☆78Updated 8 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆181Updated 9 months ago
- This is the official repository for Auto-RAG.☆212Updated 2 months ago
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- ☆122Updated 11 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆98Updated 8 months ago
- ☆156Updated 3 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆129Updated 4 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆96Updated last month
- ☆54Updated 5 months ago
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆40Updated 7 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆42Updated 5 months ago
- ☆96Updated 10 months ago
- ☆179Updated 5 months ago
- ☆17Updated 5 months ago
- ☆162Updated 4 months ago