changyeyu/LLM-RL-Visualized

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/changyeyu/LLM-RL-Visualized)

changyeyu / LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

☆4,721

Alternatives and similar repositories for LLM-RL-Visualized

Users that are interested in LLM-RL-Visualized are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

datawhalechina / happy-llm
View on GitHub
📚 从零开始构建大模型
☆32,473May 6, 2026Updated 2 months ago
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆54,029Jul 23, 2026Updated last week
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,479Jul 15, 2026Updated 2 weeks ago
liguodongiot / llm-action
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆24,819Jul 19, 2026Updated last week
ZJU-LLMs / Foundations-of-LLMs
View on GitHub
A book for Learning the Foundations of LLMs
☆16,514Dec 12, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Anionex / banana-slides
View on GitHub
一个基于nano banana pro🍌的原生AI PPT生成应用，迈向＂Vibe PPT＂; 支持上传任意模板图片，上传任意素材&智能解析，一句话/大纲/页面描述自动生成PPT，口头修改指定区域、一键导出可编辑ppt - An AI-native slides gene…
☆15,340Updated this week
wdndev / llm_interview_note
View on GitHub
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
☆14,787Jun 14, 2026Updated last month
WangRongsheng / awesome-LLM-resources
View on GitHub
🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.
☆8,769Updated this week
shareAI-lab / learn-claude-code
View on GitHub
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
☆72,631Updated this week
no-magic-ai / no-magic
View on GitHub
Because `model.fit()` isn't an explanation
☆1,390Apr 26, 2026Updated 3 months ago
jingyaogong / minimind-v
View on GitHub
👀「大模型」2小时从0训练65M参数的视觉多模态VLM！Train a 65M-parameter VLM from scratch in just 2h!
☆8,396Jun 28, 2026Updated last month
datawhalechina / hello-agents
View on GitHub
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
☆69,416Updated this week
hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,606Updated this week
datawhalechina / llm-universe
View on GitHub
本项目是一个面向小白开发者的大模型应用开发教程，在线阅读地址：https://datawhalechina.github.io/llm-universe/
☆13,669Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
adongwanai / AgentGuide
View on GitHub
https://adongwanai.github.io/AgentGuide | AI Agent开发指南 | LangGraph实战 | 高级RAG | 转行大模型 | 大模型面试 | 算法工程师 | 面试题库 | 强化学习｜数据合成
☆7,432Updated this week
luhengshiwo / LLMForEverybody
View on GitHub
每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈
☆7,042May 31, 2026Updated last month
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,711Updated this week
SwanHubX / SwanLab
View on GitHub
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …
☆4,098Updated this week
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,679Apr 26, 2026Updated 3 months ago
MaximeVandegar / Papers-in-100-Lines-of-Code
View on GitHub
Implementation of papers in 100 lines of code.
☆2,843Apr 8, 2026Updated 3 months ago
xlite-dev / LeetCUDA
View on GitHub
LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.
☆11,662Updated this week
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,995Updated this week
LLMBook-zh / LLMBook-zh.github.io
View on GitHub
《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣
☆4,534Sep 2, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Lordog / dive-into-llms
View on GitHub
《动手学大模型Dive into LLMs》系列编程实践教程
☆46,327Oct 10, 2025Updated 9 months ago
datawhalechina / tiny-universe
View on GitHub
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
☆4,987Feb 12, 2026Updated 5 months ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,752Feb 27, 2026Updated 5 months ago
MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning
View on GitHub
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
☆17,285Jul 21, 2026Updated last week
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,855Jul 14, 2026Updated 2 weeks ago
rasbt / LLMs-from-scratch
View on GitHub
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆99,963Updated this week
Infrasys-AI / AIInfra
View on GitHub
AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。
☆7,758Dec 22, 2025Updated 7 months ago
aceliuchanghong / FAQ_Of_LLM_Interview
View on GitHub
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
☆1,973Jul 20, 2026Updated last week
AiHubCN / Awesome-Chinese-LLM
View on GitHub
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
☆22,708May 10, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
datawhalechina / so-large-lm
View on GitHub
大模型基础: 一文了解大模型基础知识
☆7,522Jun 22, 2026Updated last month
DayuanJiang / next-ai-draw-io
View on GitHub
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagr…
☆33,945Updated this week
wyf3 / llm_related
View on GitHub
复现大模型相关算法及一些学习记录
☆3,471Jul 2, 2026Updated 3 weeks ago
linshenkx / prompt-optimizer
View on GitHub
An AI prompt optimizer for writing better prompts and getting better AI results.
☆32,746Updated this week
dw-dengwei / daily-arXiv-ai-enhanced
View on GitHub
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
☆2,910Updated this week
521xueweihan / HelloGitHub
View on GitHub
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
☆168,002Updated this week
RICHQAQ / PasteMD
View on GitHub
一键将 Markdown 和网页 AI 对话（ChatGPT/DeepSeek等）完美粘贴到 Word、WPS 和 Excel 的效率工具 | One-click paste Markdown and AI responses (ChatGPT/DeepSeek) into…
☆5,167Jul 22, 2026Updated last week