wingAGI/clean-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wingAGI/clean-llm)

wingAGI / clean-llm

🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据，训练Tokenizer，预训练、SFT、GRPO！

☆56

Alternatives and similar repositories for clean-llm

Users that are interested in clean-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wingAGI / cs336-assignments-answer
View on GitHub
My implementation of Stanford CS336 assignments.
☆246Mar 15, 2026Updated 4 months ago
Spectual / stanford-cs336-a1
View on GitHub
Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆77Jul 7, 2025Updated last year
mocibb / cs336
View on GitHub
☆95Jul 20, 2025Updated last year
AI45Lab / DataElf
View on GitHub
DataElf is an intelligent data workflow engine that turns natural-language tasks into secure, extensible, and executable data pipelines.
☆23Updated this week
YYZhang2025 / Stanford-CS336
View on GitHub
My Solution and Notes for the Stanford CS336: LLM from scratch
☆264Mar 23, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
garg-aayush / building-from-scratch
View on GitHub
☆68Apr 1, 2026Updated 3 months ago
xuwenxinedu / R3
View on GitHub
☆30Apr 7, 2026Updated 3 months ago
OneRaise5385 / CS336-Notes
View on GitHub
本项目是我在学习 CS336 课程过程中整理的学习笔记 This project is a collection of study notes I compiled while taking the CS336 course.
☆25Nov 1, 2025Updated 8 months ago
qiufengqijun / open-r1-reprod
View on GitHub
这是一个open-r1的复现项目，对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练，观察到一些有趣的现象。
☆64Apr 13, 2025Updated last year
shjo-april / TRACE
View on GitHub
[ICLR 2026 Oral] TRACE: Your Diffusion Model Is Secretly an Instance Edge Detector
☆18Mar 2, 2026Updated 4 months ago
SingularGuyLeBorn / Awesome-CS336-NoteForEveryone
View on GitHub
☆143Jan 18, 2026Updated 6 months ago
MUSYohann / IFViT
View on GitHub
[IEEE TIFS under review] TOPIC: IFViT: Interpretable Fixed-Length Representation for Fingerprint Matching via Vision Transformer
☆13Apr 9, 2024Updated 2 years ago
bosichong / BabyLog
View on GitHub
岁月如风，唯有此忆, 任凭时光匆匆，记录点点滴滴。当爸爸了，就多陪陪孩子，有事没事的记些东西，不要总把心思放在程序编码上，也多陪陪孩子！记录了那么多条数据，是时候也为孩子回忆做个数据，也许将来某一天你也会翻翻看看，重温那些旧时光和家人一起感慨怀念。
☆13Mar 20, 2026Updated 4 months ago
LintaoPeng / Simple_Diffusion
View on GitHub
a simple pytorch implementation of diffusiom model
☆13Mar 20, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yujianke100 / GFDN
View on GitHub
[KDD 2023] Group-based Fraud Detection Network on e-Commerce Platforms
☆16Feb 16, 2024Updated 2 years ago
Megum1 / UNIT
View on GitHub
[ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
☆10Dec 18, 2025Updated 7 months ago
ikhado / robsense
View on GitHub
☆15Mar 21, 2026Updated 4 months ago
GuoYiFantastic / IMelodist
View on GitHub
Music large model based on InternLM2-chat.
☆22Dec 21, 2024Updated last year
zhangfw123 / MACRec
View on GitHub
Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)
☆44Dec 9, 2025Updated 7 months ago
AndyJZhao / WSDM23-GSR
View on GitHub
☆19Feb 28, 2023Updated 3 years ago
PRIS-CV / GRPO-for-Llava
View on GitHub
GRPO Algorithm for Llava Architecture (Based on Verl)
☆49May 9, 2025Updated last year
DezhiKong00 / Sentencepiece-chinese-bbpe
View on GitHub
使用Sentencepiece对中文语料进行分词
☆13Nov 30, 2023Updated 2 years ago
ducha-aiki / hardnet-in-fastai2-and-kornia
View on GitHub
Re-implementation of local descriptor HardNet training in fasta2+kornia
☆21Apr 6, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Longer430 / hello_pytorch
View on GitHub
深度之眼课程
☆11Aug 28, 2020Updated 5 years ago
XiaoXiao-Woo / KAMAC
View on GitHub
A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making
☆17Oct 23, 2025Updated 9 months ago
weiliang822 / ML-BigHW
View on GitHub
同济大学计科机器学习大作业
☆10Mar 22, 2025Updated last year
dashjay / mini-lsm-go
View on GitHub
translate skyzh/mini-lsm to go version
☆10Jun 7, 2023Updated 3 years ago
Guan-JW / GMM-Isolated-Speech-Recognition
View on GitHub
基于MFCC特征构建单核GMM的0-9独立词语音识别，MFCC，GMM，sklearn，Isolated word recognition。
☆10Nov 18, 2020Updated 5 years ago
Thinklab-SJTU / WSGNN
View on GitHub
Official PyTorch implementation for the following KDD2022 paper: Variational Inference for Training Graph Neural Networks in Low-Data Re…
☆20Oct 20, 2022Updated 3 years ago
SingularGuyLeBorn / Awesome-LLM-From-Scratch-Ultimate-Tutorial
View on GitHub
☆16Nov 25, 2025Updated 8 months ago
KellyGong / SparseGAD
View on GitHub
☆19Apr 29, 2023Updated 3 years ago
fieldsoftheworld / ftw-prue
View on GitHub
Official code for the paper "PRUE: A Practical Recipe for Field Boundary Segmentation at Scale"
☆24Jul 13, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cxliu0 / KL-Loss-pytorch
View on GitHub
A pytorch reimplementation of KL-Loss (CVPR'2019)
☆15Oct 15, 2023Updated 2 years ago
Xtra-Computing / PMP
View on GitHub
☆20Mar 15, 2024Updated 2 years ago
jbistanbul / universalvtg
View on GitHub
Official Code for the paper "UniversalVTG: A Univeral and Lightweight Foundation Model for Video Temporal Grounding"
☆15Apr 15, 2026Updated 3 months ago
DeclanMcIntosh / InReaCh
View on GitHub
☆14Jan 12, 2026Updated 6 months ago
gift-surg / fetal_brain_seg
View on GitHub
A toolkit for fetal brain localization and segmentation using deep learning
☆22Apr 19, 2019Updated 7 years ago
lrf23 / MeanFlow-for-Image-Restoration
View on GitHub
☆18Jun 17, 2025Updated last year
heng380 / cs336_assignment2
View on GitHub
CS33作业 2 的代码和飞书 qa, 这个作业太恶心了, 绝对是所有作业里面花的最久的
☆24Jul 17, 2025Updated last year