llSourcell / O1-nanoLinks
This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview
☆97Updated last year
Alternatives and similar repositories for O1-nano
Users that are interested in O1-nano are comparing it to the libraries listed below
Sorting:
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆90Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆124Updated 10 months ago
- ☆126Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆187Updated last year
- ☆86Updated last year
- Beating the GAIA benchmark with Transformers Agents. 🚀☆139Updated 9 months ago
- ☆84Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 11 months ago
- Code for ScribeAgent paper☆63Updated 9 months ago
- ☆17Updated 10 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 8 months ago
- ☆62Updated last year
- ☆55Updated 3 months ago
- ☆102Updated last year
- ☆57Updated 10 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- ☆97Updated last week
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Training setup for Langchain's Open Deep Research☆72Updated 3 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 11 months ago
- ☆86Updated last year
- Finetune Llama-3-8b on the MathInstruct dataset☆115Updated last year
- Automating enterprise workflows with multimodal agents☆113Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 8 months ago
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago