🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!
☆57Aug 12, 2025Updated 8 months ago
Alternatives and similar repositories for clean-llm
Users that are interested in clean-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My implementation of Stanford CS336 assignments.☆239Mar 15, 2026Updated last month
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆26Jul 3, 2025Updated 10 months ago
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆31Jun 4, 2025Updated 11 months ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆24Jan 24, 2026Updated 3 months ago
- ☆58Feb 9, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆96Jul 20, 2025Updated 9 months ago
- 记录我在cs336学习时的笔记和作业☆819Mar 30, 2026Updated last month
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- translate skyzh/mini-lsm to go version☆10Jun 7, 2023Updated 2 years ago
- derived from https://github.com/wilfredinni/python-cheatsheet☆10Nov 8, 2023Updated 2 years ago
- (ICCV 2025) DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup☆57Dec 13, 2025Updated 4 months ago
- ☆14Jan 12, 2026Updated 3 months ago
- 本项目是我在学习 CS336 课程过程中整理的学习笔记 This project is a collection of study notes I compiled while taking the CS336 course.☆24Nov 1, 2025Updated 6 months ago
- ☆13Jun 21, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Apr 23, 2026Updated last week
- ☆13Sep 25, 2021Updated 4 years ago
- Real-time speech-to-text clipboard tool with Silero VAD and local ASR support☆16Apr 29, 2026Updated last week
- basic git commands list!☆14Oct 18, 2018Updated 7 years ago
- xv6 riscv completing all the MIT optional challenges☆16Mar 5, 2024Updated 2 years ago
- hustpa ics2019☆10Jul 11, 2022Updated 3 years ago
- ☆14Mar 29, 2026Updated last month
- CMU 15-440/640 Distributed Systems☆15Oct 2, 2014Updated 11 years ago
- A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。☆36Oct 18, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unofficial PyTorch implementation of 'Fast and High-Quality Image Denoising via Malleable Convolutions'.☆12Mar 7, 2026Updated last month
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated last month
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Nov 14, 2023Updated 2 years ago
- ☆118Jan 18, 2026Updated 3 months ago
- A simple stereo SLAM system using Seq-CALC loop detection module. May be useful and friendly for SLAM beginners.☆14Jun 10, 2020Updated 5 years ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆49May 9, 2025Updated 11 months ago
- [Submitted to NeurIPS 2024 Dataset & Benchmark Track] AFBench: A Large-scale Benchmark for Airfoil Design☆15Jul 5, 2024Updated last year
- Building a VLM model starts from the basic module.☆18Apr 7, 2024Updated 2 years ago
- [AAAI 2024] Official pytorch implementation of “Learning Real-World Image De-Weathering with Imperfect Supervision”☆17Aug 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A fast c++/cuda library of 3D render for the medical imaging☆16Jan 31, 2023Updated 3 years ago
- 华中科技大学大学CS课程其它报告存档库。组成原理、计算机网络、汇编语言、数据库、操作系统、课程设计以及大数据处理。☆10Jan 16, 2024Updated 2 years ago
- (CVPR2025) the code of "Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection"☆68Jun 23, 2025Updated 10 months ago
- MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios☆138Mar 4, 2026Updated 2 months ago
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)☆23Mar 6, 2025Updated last year
- 国科大雁栖湖校区2024~2025年课程资料,包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理☆41Sep 22, 2025Updated 7 months ago
- Deep Bayesian Optimization for Problems with High-Dimensional Structure☆17Sep 26, 2022Updated 3 years ago