huggingface / open-r1
Fully open reproduction of DeepSeek-R1
โ23,242Updated this week
Alternatives and similar repositories for open-r1:
Users that are interested in open-r1 are comparing it to the libraries listed below
- Clean, minimal, accessible reproduction of DeepSeek R1-Zeroโ11,339Updated 2 weeks ago
- Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! ๐ฆฅโ35,893Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.โ9,282Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMsโ5,693Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)โ45,117Updated this week
- This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited dataโ3,223Updated this week
- โ87,317Updated last month
- s1: Simple test-time scalingโ6,051Updated 3 weeks ago
- DeepSeek Coder: Let the Code Write Itselfโ21,179Updated 10 months ago
- Train transformer language models with reinforcement learning.โ12,890Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsโ42,924Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsโฆโ16,515Updated last week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.โ16,356Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.โ12,427Updated this week
- Fast and memory-efficient exact attentionโ16,587Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemโ23,891Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizationsโ13,116Updated this week
- The official repo of Qwen (้ไนๅ้ฎ) chat & pretrained large language model proposed by Alibaba Cloud.โ17,579Updated last month
- Janus-Series: Unified Multimodal Understanding and Generation Modelsโ16,878Updated last month
- โ3,242Updated 3 weeks ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understandingโ4,628Updated last month
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovationโ6,926Updated 3 weeks ago
- ๐ค smolagents: a barebones library for agents that think in python code.โ15,909Updated this week
- Witness the aha moment of VLM with less than $3.โ3,376Updated 3 weeks ago
- FlashMLA: Efficient MLA decoding kernelsโ11,369Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)โ5,919Updated this week
- โ94,439Updated last week
- ๐ฅ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.โ32,762Updated this week
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawlโ5,145Updated last month
- ๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.โ17,913Updated this week