JuniMay / llm.rsLinks
An attempt to migrate Karpathy's llm.c to safe rust.
☆13Updated last year
Alternatives and similar repositories for llm.rs
Users that are interested in llm.rs are comparing it to the libraries listed below
Sorting:
- ☆126Updated 2 weeks ago
- LLM Inference benchmark☆433Updated last year
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- ☆175Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Updated 5 months ago
- 更纯粹、更高压缩率的Tokenizer☆490Updated last year
- a lightweight LLM model inference framework☆748Updated last year
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆251Updated last year
- Retriever-0.1B☆96Updated last year
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆226Updated 3 weeks ago
- Wiki fo HPC☆130Updated 6 months ago
- ☆34Updated last year
- ☆523Updated 2 weeks ago
- C++ implementation of Qwen-LM☆616Updated last year
- The dataset and baseline code for ASC23 LLM inference optimization challenge.☆32Updated 2 years ago
- Accelerate inference without tears☆372Updated 2 weeks ago
- Efficient AI Inference & Serving☆479Updated 2 years ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆154Updated last month
- 笔记☆50Updated 5 months ago
- ☆183Updated 2 weeks ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆274Updated 6 months ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆120Updated last year
- LongBench v2 and LongBench (ACL 25'&24')☆1,081Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆418Updated last year
- Efficient Training (including pre-training and fine-tuning) for Big Models☆618Updated 3 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆103Updated last month
- ☆68Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆703Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆256Updated last year
- ☆234Updated last year