This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models with 100-200M parameters.
☆183Apr 27, 2026Updated last week
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆31Oct 16, 2025Updated 6 months ago
- let coding agents use ncu skills analysis cuda program automatically!☆94Feb 5, 2026Updated 3 months ago
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year
- Rust面试题收集☆12Jan 16, 2023Updated 3 years ago
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆12Sep 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Curated collection of AI inference engineering resources — LLM serving, GPU kernels, quantization, distributed inference, and production …☆103Feb 4, 2026Updated 3 months ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- ☆45Nov 1, 2025Updated 6 months ago
- ☆12Aug 25, 2023Updated 2 years ago
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆18May 23, 2025Updated 11 months ago
- Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…☆49Apr 8, 2026Updated last month
- Triton Compiler related materials.☆44Mar 16, 2026Updated last month
- ☆18Oct 28, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 5 years ago
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆22Feb 10, 2025Updated last year
- 中华药典RAG项目☆10Oct 26, 2024Updated last year
- ☆13Aug 13, 2025Updated 8 months ago
- ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression (DAC'25)☆27Feb 26, 2026Updated 2 months ago
- CS341 for Spring 2024☆11Jul 15, 2024Updated last year
- ☆49Apr 15, 2024Updated 2 years ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- VS Code 配置 LaTeX 进行高效论文写作☆104Jan 28, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆41Apr 19, 2026Updated 2 weeks ago
- https://github.com/zyds/transformers-code☆18Jan 17, 2024Updated 2 years ago
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆128Apr 7, 2026Updated last month
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- The system of SUDA-HUAWEI submitted at CAMR2022.☆12Nov 22, 2022Updated 3 years ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆41Nov 26, 2025Updated 5 months ago
- ☆23Jun 28, 2025Updated 10 months ago
- ☆13Apr 3, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- alphafold FAPE loss☆10Sep 28, 2021Updated 4 years ago
- CS6868: Concurrent Programming☆70Apr 20, 2026Updated 2 weeks ago
- Ultrafast PyTorch-like AI Framework Written from Ground-Up in Rust☆99Mar 18, 2026Updated last month
- ☆18Dec 17, 2022Updated 3 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- ☆40Feb 14, 2026Updated 2 months ago