lyogavin / airllm
AirLLM 70B inference with single 4GB GPU
☆5,194Updated last month
Related projects ⓘ
Alternatives and complementary repositories for airllm
- Retrieval and Retrieval-augmented LLMs☆7,487Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,848Updated 6 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆5,984Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆3,944Updated this week
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,024Updated this week
- Tools for merging pretrained large language models.☆4,798Updated last week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,443Updated last month
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.