从零到一实现一个 miniLLM~(动手学习LLM)
☆79May 4, 2026Updated 2 months ago
Alternatives and similar repositories for LLMs-101
Users that are interested in LLMs-101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆554Mar 23, 2025Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆505May 1, 2025Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Android native mediacodec decode/encode demo☆14Dec 16, 2021Updated 4 years ago
- This is the codes of DGEL. Thanks for your kindly attention and citing.☆12Nov 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementing RAG Knowledge Base with Langchain☆14Nov 7, 2024Updated last year
- Semantic Segmentation on the ACDC Cardiac Dataset☆11Nov 18, 2024Updated last year
- 「PyTorch」A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors …☆91Jun 12, 2022Updated 4 years ago
- 数字人+大模型☆26Nov 7, 2023Updated 2 years ago
- RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation☆45Dec 9, 2025Updated 6 months ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,926May 21, 2024Updated 2 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 6 years ago
- A solver for linear complementarity problems☆12Dec 16, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows☆43Jul 7, 2025Updated 11 months ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated 2 years ago
- 广工java课设--带图形界面的即时多人聊天程序☆11May 25, 2022Updated 4 years ago
- 解决html转pdf的分页问题☆20Jul 27, 2022Updated 3 years ago
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- Python package to process videos as in Hu and Ma (2024)☆21Sep 29, 2024Updated last year
- Animated Grid Ionic App inspired by a Codrops tutorial☆11Apr 24, 2016Updated 10 years ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆27Jun 24, 2024Updated 2 years ago
- 从零实现一个小参数量中文大语言模型。☆1,052Aug 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A fun android game to train your brain with some quick math quizes.☆12May 30, 2019Updated 7 years ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆89Sep 21, 2024Updated last year
- A moveit package of mycobot☆12Sep 23, 2022Updated 3 years ago
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- ☆17Apr 16, 2021Updated 5 years ago
- The MongoDB Database☆22Dec 7, 2016Updated 9 years ago
- Fetch and insert AI-generated summaries of web content. Combine with Send To Kindle for quick summaries and full articles. Support for Mi…☆23May 19, 2026Updated last month
- ☆13Jan 23, 2025Updated last year
- ☆13Jan 10, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 安卓大作业,仿微信,简单UI,未对接后台,老人版微信☆10Jan 4, 2020Updated 6 years ago
- Connect and dynamically manage multiple MCP servers/tools through a single SSE interface, allowing your AI agent or AI APP to control MCP…☆17May 22, 2025Updated last year
- I recently attended the Geekbang "Large Language Models Application Development Practice Camp", where I learned about the application de…☆45Aug 23, 2024Updated last year
- cracked prompt of famous coding agent and autodev☆24Mar 19, 2026Updated 3 months ago
- 区块链,比特币,数字货币,加密算法☆10Apr 1, 2018Updated 8 years ago
- 18年912真题回忆☆11Dec 24, 2018Updated 7 years ago
- 清华大学计算机辅修数据结构作业(2015春季学期)☆12Jun 21, 2015Updated 11 years ago