🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!
☆57Aug 12, 2025Updated 10 months ago
Alternatives and similar repositories for clean-llm
Users that are interested in clean-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My implementation of Stanford CS336 assignments.☆239Mar 15, 2026Updated 3 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆29Jul 3, 2025Updated 11 months ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆25Jan 24, 2026Updated 4 months ago
- TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation☆124May 30, 2026Updated 2 weeks ago
- ☆56May 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆97Jul 20, 2025Updated 10 months ago
- ☆26Apr 27, 2025Updated last year
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆19Feb 4, 2026Updated 4 months ago
- derived from https://github.com/wilfredinni/python-cheatsheet☆10Nov 8, 2023Updated 2 years ago
- [NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents☆13Apr 26, 2021Updated 5 years ago
- Project 1 of Udacity's Deep Reinforcement Learning nanodegree program☆13Dec 2, 2018Updated 7 years ago
- ☆28Jan 17, 2026Updated 4 months ago
- ☆15Apr 23, 2026Updated last month
- Autonomous vehicle learn how to navigate efficiently at crossroad☆16Jan 31, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- xv6 riscv completing all the MIT optional challenges☆16Mar 5, 2024Updated 2 years ago
- hustpa ics2019☆10Jul 11, 2022Updated 3 years ago
- ☆14Mar 29, 2026Updated 2 months ago
- A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.☆106Mar 24, 2026Updated 2 months ago
- CMU 15-440/640 Distributed Systems☆15Oct 2, 2014Updated 11 years ago
- Split Learning Simulation Framework for LLMs☆42Sep 10, 2024Updated last year
- mit 6.5840: Distributed Systems Spring 2023☆16Apr 10, 2023Updated 3 years ago
- ☆22Jul 15, 2020Updated 5 years ago
- An iOS third-party RAW camera app with HDR+ and histogram viewer.☆15Jun 25, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆18Nov 14, 2023Updated 2 years ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆49May 9, 2025Updated last year
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆69Jul 1, 2025Updated 11 months ago
- 🔥🔥🔥 We release an implementation of many attention mechanism models! Plug-and-play, performance booster!☆19Aug 14, 2024Updated last year
- ☆123Jan 18, 2026Updated 4 months ago
- Building a VLM model starts from the basic module.☆18Apr 7, 2024Updated 2 years ago
- This repository lists some awesome public Open World object detection series projects.☆31Feb 22, 2024Updated 2 years ago
- [AAAI 2024] Official pytorch implementation of “Learning Real-World Image De-Weathering with Imperfect Supervision”☆17Aug 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (CVPR2025) the code of "Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection"☆71Jun 23, 2025Updated 11 months ago
- 国科大雁栖湖校区2024~2025年课程资料,包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理☆48Sep 22, 2025Updated 8 months ago
- [KDD 2026 Oral] MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios☆152Updated this week
- [CVPR 2024 Highlight] SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image☆40Jul 30, 2024Updated last year
- 国科大(UCAS)李保滨老师2020矩阵分析课程资源(课件,作业,和编程大作业)☆26Jun 5, 2024Updated 2 years ago
- Image processor service using Sharp, gRPC and Node.js☆16May 1, 2024Updated 2 years ago
- An implementation of GRPO for Unsloth's VLMs training☆84Aug 7, 2025Updated 10 months ago