llSourcell / O1-nanoLinks
This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview
☆97Updated 7 months ago
Alternatives and similar repositories for O1-nano
Users that are interested in O1-nano are comparing it to the libraries listed below
Sorting:
- Simple examples using Argilla tools to build AI☆53Updated 6 months ago
- Code for ScribeAgent paper☆57Updated 3 months ago
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆87Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 2 weeks ago
- LLM reads a paper and produce a working prototype☆57Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆113Updated 3 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆120Updated 3 months ago
- ☆50Updated this week
- ☆92Updated 2 months ago
- ☆67Updated 3 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆97Updated 7 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated 2 months ago
- ☆54Updated 3 months ago
- ☆80Updated 2 weeks ago
- ☆86Updated 8 months ago
- ☆86Updated 2 weeks ago
- LLM that can analyze stocks☆23Updated 6 months ago
- ☆89Updated last week
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 5 months ago
- ☆57Updated last week
- ☆56Updated 5 months ago
- ☆92Updated 3 weeks ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆162Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆140Updated 2 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆39Updated 4 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago