JuniMay / llm.rsLinks
An attempt to migrate Karpathy's llm.c to safe rust.
☆13Updated last year
Alternatives and similar repositories for llm.rs
Users that are interested in llm.rs are comparing it to the libraries listed below
Sorting:
- ☆126Updated last week
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- 笔记☆44Updated 2 months ago
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆60Updated 9 months ago
- LLM Inference benchmark☆427Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Updated last month
- Fairy±i (iFairy): Complex-valued Quantization Framework for Large Language Models☆104Updated last week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆86Updated 6 months ago
- ☆147Updated 3 months ago
- ☆164Updated last year
- easy cuda code☆84Updated 9 months ago
- Wiki fo HPC☆121Updated 2 months ago
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆198Updated last week
- 尝试自己从头写一个LLM,参考llama和nanogpt☆66Updated last year
- ☆13Updated last year
- USTC Principles and Techniques of Compiler 2023 homepage☆28Updated 11 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆285Updated 9 months ago
- MoFA - Modular Framework for Agents. Modular, Compositional and Programmable.☆117Updated 2 weeks ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆114Updated this week
- A PyTorch-like deep learning framework. Just for fun.☆156Updated 2 years ago
- ☆63Updated 11 months ago
- LLM101n: Let's build a Storyteller 中文版☆133Updated last year
- ☆259Updated last week
- Retriever-0.1B☆95Updated last year
- 更纯粹、更高压缩率的Tokenizer☆485Updated 10 months ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆19Updated 9 months ago
- ☆23Updated 6 months ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆545Updated 3 weeks ago
- 《解构大语言模型:从线性回归到通用人工智能》配套代码☆246Updated 3 months ago
- Efficient inference of large language models.☆149Updated 3 weeks ago