NexaAI / Awesome-LLMs-on-deviceLinks
Awesome LLMs on Device: A Comprehensive Survey
β1,231Updated 9 months ago
Alternatives and similar repositories for Awesome-LLMs-on-device
Users that are interested in Awesome-LLMs-on-device are comparing it to the libraries listed below
Sorting:
- Unified KV Cache Compression Methods for Auto-Regressive Modelsβ1,262Updated 9 months ago
- [ICLR 2025π₯] SVD-LLM & [NAACL 2025π₯] SVD-LLM V2β257Updated last month
- Fast Multimodal LLM on Mobile Devicesβ1,118Updated last week
- A highly optimized LLM inference acceleration engine for Llama and its variants.β902Updated 3 months ago
- Train your Agent model via our easy and efficient frameworkβ1,571Updated last week
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasksβ3,199Updated last week
- Align Anything: Training All-modality Model with Feedbackβ4,570Updated 2 months ago
- β897Updated this week
- An acceleration library that supports arbitrary bit-width combinatorial quantization operationsβ236Updated last year
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challengesβ1,907Updated 2 weeks ago
- Recipes to train reward model for RLHF.β1,470Updated 6 months ago
- adds Sequence Parallelism into LLaMA-Factoryβ578Updated last week
- Large Language Model (LLM) Systems Paper Listβ1,563Updated last week
- A curated list for Efficient Large Language Modelsβ1,874Updated 4 months ago
- minimal-cost for training 0.5B R1-Zeroβ778Updated 5 months ago
- The official implementation of Self-Play Preference Optimization (SPPO)β582Updated 9 months ago
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,221Updated 4 months ago
- Build multimodal language agents for fast prototype and productionβ2,565Updated 7 months ago
- Survey Paper List - Efficient LLM and Foundation Modelsβ258Updated last year
- βCurie: Automated and Rigorous Scientific Experimentation with AI Agentsβ294Updated 3 weeks ago
- Awesome LLM compression research papers and tools.β1,690Updated 3 months ago
- Awesome Mobile LLMsβ256Updated last week
- Fast inference from large lauguage models via speculative decodingβ841Updated last year
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilitiesγβ1,828Updated 9 months ago
- Uni-MoE: Lychee's Large Multimodal Model Family.β793Updated this week
- Easiest and laziest way for building multi-agent LLMs applications.β2,996Updated last week
- Low-bit LLM inference on CPU/NPU with lookup tableβ876Updated 4 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Headsβ2,646Updated last year
- [COLMβ25] DeepRetrieval β π₯ The First Search Agent Trained by On-Policy Reinforcement Learningβ654Updated 2 weeks ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"β551Updated 2 months ago