hijkzzz / Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
☆5,211Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-LLM-Strawberry
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆3,906Updated last month
- O1 Replication Journey: A Strategic Progress Report – Part I☆1,318Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)☆2,666Updated this week
- Tools for merging pretrained large language models.☆4,816Updated 2 weeks ago
- PyTorch native finetuning library☆4,336Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆2,010Updated 3 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆791Updated 2 weeks ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆2,281Updated last month
- Robust recipes to align language models with human and AI preferences☆4,680Updated last month
- Composable building blocks to build Llama Apps☆4,594Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,227Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- ☆819Updated last month
- ☆935Updated 2 weeks ago
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆6,852Updated 3 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,045Updated this week
- ☆2,746Updated 2 months ago
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆3,977Updated last week
- ☆2,595Updated last week
- [TMLR] A curated list of language modeling researches for code and related datasets.☆1,697Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,597Updated last month
- DataComp for Language Models☆1,157Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆4,669Updated this week
- Must-read Papers on LLM Agents.☆1,864Updated last week
- Video+code lecture on building nanoGPT from scratch☆3,611Updated 3 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,141Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,505Updated last month
- A native PyTorch Library for large model training☆2,623Updated this week
- ☆2,898Updated last month
- Large Reasoning Models☆580Updated this week