MekkCyber / EdgeLLMLinks
Deploying large language models on edge
☆62Updated 3 months ago
Alternatives and similar repositories for EdgeLLM
Users that are interested in EdgeLLM are comparing it to the libraries listed below
Sorting:
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆81Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆242Updated 2 weeks ago
- ☆66Updated last year
- ☆106Updated 3 months ago
- ☆112Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆176Updated 9 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago
- ☆86Updated 9 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆110Updated 9 months ago
- Lego for GRPO☆28Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago
- Own your AI, search the web with it🌐😎☆86Updated 6 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 3 weeks ago
- ☆96Updated last month
- ⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀☆57Updated last month
- Embed anything.☆28Updated last year
- Self-host LLMs with vLLM and BentoML☆134Updated 2 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated last week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- ☆101Updated last month
- On-device intelligence.☆363Updated 3 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 5 months ago
- ☆156Updated 3 months ago
- Fine tune Gemma 3 on an object detection task☆72Updated this week
- From data to vector database effortlessly☆77Updated 2 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- ☆115Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 4 months ago
- Distributed Inference for mlx LLm☆93Updated 11 months ago