NexaAI / Awesome-LLMs-on-deviceLinks
Awesome LLMs on Device: A Comprehensive Survey
☆1,160Updated 6 months ago
Alternatives and similar repositories for Awesome-LLMs-on-device
Users that are interested in Awesome-LLMs-on-device are comparing it to the libraries listed below
Sorting:
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,207Updated 6 months ago
- Train your Agent model via our easy and efficient framework☆1,284Updated last week
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆900Updated 2 weeks ago
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆231Updated 4 months ago
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆1,258Updated last week
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆228Updated 9 months ago
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks☆2,773Updated this week
- Fast Multimodal LLM on Mobile Devices☆965Updated this week
- adds Sequence Parallelism into LLaMA-Factory☆534Updated last week
- Align Anything: Training All-modality Model with Feedback☆4,286Updated last month
- A scalable, end-to-end training pipeline for general-purpose agents☆346Updated 3 weeks ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆569Updated 6 months ago
- Build multimodal language agents for fast prototype and production☆2,531Updated 4 months ago
- [COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆596Updated last month
- Recipes to train reward model for RLHF.☆1,414Updated 3 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆245Updated this week
- TVM Documentation in Chinese Simplified / TVM 中文文档☆2,005Updated 3 months ago
- Easiest and laziest way for building multi-agent LLMs applications.☆2,243Updated this week
- In-depth study of the graphrag☆1,377Updated 3 weeks ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,578Updated last year
- ☆467Updated this week
- minimal-cost for training 0.5B R1-Zero☆756Updated 2 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆264Updated 4 months ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,205Updated 3 months ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,192Updated last month
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆746Updated 2 months ago
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).☆310Updated last month
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆516Updated this week
- Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.☆1,419Updated last week
- Survey Paper List - Efficient LLM and Foundation Models☆252Updated 10 months ago