从零到一实现一个 miniLLM~(动手学习LLM)
☆79May 4, 2026Updated last month
Alternatives and similar repositories for LLMs-101
Users that are interested in LLMs-101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆552Mar 23, 2025Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆503May 1, 2025Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Implementing RAG Knowledge Base with Langchain☆14Nov 7, 2024Updated last year
- 雅思词汇真经、雅思语法、听力 179、阅读 538 同义替换等。Everything during preparing for my IELTS exam.☆18Feb 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- My computational narrative notebooks.☆10Aug 13, 2018Updated 7 years ago
- 「PyTorch」A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors …☆91Jun 12, 2022Updated 4 years ago
- 数字人+大模型☆26Nov 7, 2023Updated 2 years ago
- ☆22Dec 25, 2019Updated 6 years ago
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,920May 21, 2024Updated 2 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 6 years ago
- A solver for linear complementarity problems☆12Dec 16, 2021Updated 4 years ago
- PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows☆43Jul 7, 2025Updated 11 months ago
- ANN-based Expectations Algorithm applied to the Neoclassical Investment Model☆10Mar 15, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- 保存有关DDPM直播的资料☆20Apr 7, 2024Updated 2 years ago
- Replication material for "Optimal Automatic Stabilizers"☆11Aug 9, 2021Updated 4 years ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆27Jun 24, 2024Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆281Sep 5, 2024Updated last year
- 从零实现一个小参数量中文大语言模型。☆1,043Aug 22, 2024Updated last year
- A fun android game to train your brain with some quick math quizes.☆12May 30, 2019Updated 7 years ago
- This repository contains the code to generate results from the paper "Artificial Neural Networks to solve dynamic programming problems: a…☆10May 24, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Jan 17, 2018Updated 8 years ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆88Sep 21, 2024Updated last year
- 打车软件服务器端☆13Nov 15, 2015Updated 10 years ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆13Nov 22, 2021Updated 4 years ago
- ☆28Jan 17, 2026Updated 4 months ago
- The MongoDB Database☆22Dec 7, 2016Updated 9 years ago
- ☆12Aug 6, 2024Updated last year
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆19Jul 10, 2025Updated 11 months ago
- Fetch and insert AI-generated summaries of web content. Combine with Send To Kindle for quick summaries and full articles. Support for Mi…☆23May 19, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 区块链,比特币,数字货币,加密算法☆10Apr 1, 2018Updated 8 years ago
- cracked prompt of famous coding agent and autodev☆22Mar 19, 2026Updated 2 months ago
- 18年912真题回忆☆11Dec 24, 2018Updated 7 years ago
- ☆10Jan 25, 2018Updated 8 years ago
- A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling☆15Dec 5, 2023Updated 2 years ago
- Replication fles for numerical solution in "Monetary Policy, Redistribution, and Risk Premia"☆13Jan 23, 2024Updated 2 years ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,711Apr 20, 2024Updated 2 years ago