ai-glimpse / toyllmLinks
ToyLLM: Learning LLM from Scratch
☆24Updated this week
Alternatives and similar repositories for toyllm
Users that are interested in toyllm are comparing it to the libraries listed below
Sorting:
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆61Updated last year
- Manages vllm-nccl dependency☆17Updated last year
- 电子鹦鹉 / Toy Language Model☆232Updated last week
- ☆78Updated 3 weeks ago
- ☆82Updated 2 years ago
- 最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.☆39Updated 5 months ago
- PUA prompts for AI Agent!☆188Updated last month
- Workflow Defined Engine☆25Updated last month
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆535Updated 3 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆95Updated this week
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆55Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Updated 3 months ago
- Tiny C++ LLM inference implementation from scratch☆95Updated 2 weeks ago
- ☆149Updated 5 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆134Updated this week
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆20Updated last year
- AI Workshop Project of OceanBase 2024 Product Launch☆47Updated 2 months ago
- 模型压缩的小白入门教程☆22Updated last year
- A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.☆134Updated 2 weeks ago
- 基于Roo Cline+DeepSeek的AI开发教程☆75Updated 9 months ago
- A PyTorch-like deep learning framework. Just for fun.☆157Updated 2 years ago
- ☆125Updated 2 months ago
- Implement custom operators in PyTorch with cuda/c++☆74Updated 2 years ago
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆212Updated 2 months ago
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆376Updated 3 months ago
- 从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch☆36Updated last year
- 分层解耦的深度学习推理引擎☆78Updated 10 months ago
- The source code of the series "Pocket-OS"☆39Updated last year
- A Python Package to Access World-Class Generative Models☆131Updated last year