zhejianglab / Data-Processing-Toolkit-for-LLMsLinks
☆24Updated last year
Alternatives and similar repositories for Data-Processing-Toolkit-for-LLMs
Users that are interested in Data-Processing-Toolkit-for-LLMs are comparing it to the libraries listed below
Sorting:
- Reproduce R1 Zero on Logic Puzzle☆2,432Updated 10 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,572Updated 2 weeks ago
- 该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题☆2,425Updated last year
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,300Updated 2 years ago
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆737Updated 11 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆8,949Updated last week
- ☆1,293Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆788Updated 9 months ago
- 利用HuggingFace的官方下载工具从镜像网站进行高速下载。☆1,299Updated last year
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆618Updated 2 years ago
- 复现大模型相关算法及一些学习记录☆2,942Updated last week
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆928Updated 2 months ago
- BBPE 底层实现☆38Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,639Updated 3 weeks ago
- 从零实现一个小参数量中文大语言模型。☆940Updated last year
- Huggingface transformers的中文文档☆293Updated 2 years ago
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,384Updated this week
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,509Updated this week
- 通义千问VLLM推理部署DEMO☆638Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,889Updated last year
- Qwen3 Fine-tuning: Medical R1 Style Chat☆277Updated 8 months ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (…☆12,594Updated this week
- LLM&VLM Tutorial☆1,933Updated 9 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,132Updated this week
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,409Updated 2 years ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆495Updated 9 months ago
- Community maintained hardware plugin for vLLM on Ascend☆1,651Updated this week
- DeepSeek 系列工作解读、扩展和复现。☆700Updated 10 months ago
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆600Updated 8 months ago
- Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷☆5,840Updated this week