FastTrack4LLM 是一个为大模型学习者准备的大模型学习与实践框架,帮助他们轻松掌握大模型的核心原理与训练流程,让每个人都能真正理解大模型的内部机制。本项目不仅完整复现了 LLaMA、Qwen、DeepSeek 等主流开源大模型架构,还覆盖了大模型的全生命周期:Tokenizer 训练、预训练、全量微调、参数高效微调(LoRA)、人类反馈对齐(DPO)、知识蒸馏等。不同于仅仅调用API或使用现成模型,我们带你从零开始,亲手构建、训练、优化属于自己的大语言模型,快来体验吧!
☆27Nov 6, 2025Updated 4 months ago
Alternatives and similar repositories for FastTrack4LLM
Users that are interested in FastTrack4LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NewSP: A New Search Process for Continuous Subgraph Matching over Dynamic Graphs[ICDE 24]☆15Oct 10, 2024Updated last year
- 本仓库是一份面向大模型算法工程师的面试宝典,系统梳理了大模型的核心知识点,帮助读者快速掌握大模型面试中的重点和难点☆46Sep 23, 2025Updated 6 months ago
- OpenVPN Install Script☆13Dec 31, 2022Updated 3 years ago
- 支持自动注册cursor邮箱☆10Feb 15, 2025Updated last year
- Language Models as Multi-Modal Query Planners☆17Mar 20, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Apr 2, 2024Updated last year
- About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.☆20Jun 25, 2024Updated last year
- A full-stack app specifically designed to track issues.☆14May 23, 2023Updated 2 years ago
- LLM inference in C/C++☆20Oct 22, 2025Updated 5 months ago
- A virtual fidget spinner made in JavaScript.☆21Jul 3, 2018Updated 7 years ago
- ☆11Mar 6, 2023Updated 3 years ago
- Linked Stream Benchmark☆12Feb 21, 2023Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- SmartTalk(智言)输入法是一个智能输入法项目,项目目标是通过集成先进的AI技术,将传统输入法从“工具”升级为“教练”,实现功能融合与创新,让用户秒变沟通艺术家。☆14Mar 30, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Developing a high-precision legal expert LLM application called Contract Advisor RAG. The project's goal is to create a Retrieval Augment…☆15Apr 10, 2024Updated last year
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- ☆20Mar 25, 2019Updated 7 years ago
- VHDL Implementation☆13Oct 9, 2014Updated 11 years ago
- 现代AI企业网站☆26Mar 22, 2025Updated last year
- Gremlin++: A C++ Interpreter for the Gremlin language.☆19Dec 26, 2024Updated last year
- (ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).☆31Sep 20, 2024Updated last year
- [CVPR2023] Practical Network Acceleration with Tiny Sets☆14Jul 28, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NewOaks AI is an AI chatbot builder, which allows you to engage with customers, nurture leads and book conversational appointments for yo…☆21Apr 3, 2024Updated last year
- A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.☆31Jan 8, 2023Updated 3 years ago
- Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content …☆12Jan 24, 2024Updated 2 years ago
- Source Code for "Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks"☆31Jul 9, 2024Updated last year
- Experiments codes for WSDM '24 paper "MultiFS: Automated Multi-Scenario Feature Selection in Deep Recommender Systems"☆11May 31, 2024Updated last year
- Dummy form filler for Firefox & Chrome☆20Dec 4, 2025Updated 3 months ago
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆51Jan 20, 2026Updated 2 months ago
- ☆18Aug 17, 2014Updated 11 years ago
- Source code of "RapidFlow: An Efficient Approach to Continuous Subgraph Matching" published in VLDB'2022 - By Shixuan Sun, Xibo Sun, Bing…☆32Jun 30, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 一个适用于 macOS 的轻量级 WeChat 双开/多开脚本工具,支持 微信 4.x 及以上版本,帮助用户在 Mac 上同时登录多个微信账号。通过简单的 shell 脚本,你可以一键实现 微信双开,批量创建多个独立的 WeChat 副本,支持自动修改 Bundle ID、…☆72Sep 2, 2025Updated 6 months ago
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- The official implementation of paper "Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors…☆15Jun 14, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- A sophisticated web application designed to revolutionize the resume screening process by harnessing the power of multiple state-of-the-a…☆11Mar 13, 2025Updated last year
- Q-RR, DIANA-RR, Q-NASTYA, NASTYA-DIANA, QSGD, DIANA, FedCOM and FedPAQ on logistic loss with L2 regularization☆11Nov 1, 2022Updated 3 years ago
- Seo friendly Next.js 14 (SSG + ISR) store buit with Sanity CMS, Typescript, Tailwind, Shadcn/ui, GSAP and client pagination☆27Jun 3, 2025Updated 9 months ago