hijkzzz / Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 π and reasoning techniques.
β6,726Updated this week
Alternatives and similar repositories for Awesome-LLM-Strawberry
Users that are interested in Awesome-LLM-Strawberry are comparing it to the libraries listed below
Sorting:
- verl: Volcano Engine Reinforcement Learning for LLMsβ7,873Updated this week
- Simple RL training for reasoningβ3,560Updated last month
- O1 Replication Journeyβ1,989Updated 4 months ago
- Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 πβ3,071Updated last week
- Democratizing Reinforcement Learning for LLMsβ3,236Updated this week
- An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)β6,661Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ2,253Updated this week
- Official Repo for Open-Reasoner-Zeroβ1,916Updated last month
- Scalable RL solution for advanced reasoning of language modelsβ1,552Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsβ1,767Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β1,813Updated this week
- Curated list of datasets and tools for post-training.β3,044Updated 3 months ago
- A library for advanced large language model reasoningβ2,122Updated last month
- Witness the aha moment of VLM with less than $3.β3,658Updated 2 months ago
- PyTorch native post-training libraryβ5,171Updated last week
- Reproduce R1 Zero on Logic Puzzleβ2,337Updated last month
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chainsβ4,213Updated 3 months ago
- Minimal reproduction of DeepSeek R1-Zeroβ11,753Updated 3 weeks ago
- π° Must-read papers and blogs on LLM based Long Context Modeling π₯β1,477Updated last week
- Robust recipes to align language models with human and AI preferencesβ5,173Updated 2 weeks ago
- β1,356Updated 5 months ago
- AllenAI's post-training codebaseβ2,950Updated this week
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...β1,992Updated 2 weeks ago
- β2,798Updated last week
- Awesome Reasoning LLM Tutorial/Survey/Guideβ1,605Updated last month
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRLβ2,355Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,700Updated this week
- β4,079Updated 11 months ago
- Fully open data curation for reasoning modelsβ1,772Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,265Updated 2 months ago