MekkCyber / EdgeLLMLinks
Deploying large language models on edge
β74Updated 5 months ago
Alternatives and similar repositories for EdgeLLM
Users that are interested in EdgeLLM are comparing it to the libraries listed below
Sorting:
- React Native binding of llama.cppβ39Updated this week
- Own your AI, search the web with itππβ89Updated 8 months ago
- Verifiers for LLM Reinforcement Learningβ75Updated 2 weeks ago
- β155Updated 5 months ago
- One click templates for inferencing Language Modelsβ214Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ268Updated 2 months ago
- β35Updated 8 months ago
- Fast parallel LLM inference for MLXβ219Updated last year
- β104Updated 3 months ago
- β338Updated this week
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ31Updated last week
- Routing on Random Forest (RoRF)β206Updated last year
- β132Updated 5 months ago
- chrome & firefox extension to chat with webpages: local llmsβ126Updated 9 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)β306Updated 5 months ago
- Sparse Inferencing for transformer based LLMsβ198Updated last month
- The State Of The Art, intelligenceβ152Updated last month
- Developer tools to debug and build realtime voice agents. Supports multiple models.β48Updated last month
- A Lightweight Library for AI Observabilityβ251Updated 7 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated 9 months ago
- From data to vector database effortlesslyβ80Updated 4 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusionβ155Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ451Updated last month
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.β42Updated 3 months ago
- β182Updated 7 months ago
- β263Updated 3 months ago
- β116Updated 9 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β117Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)β434Updated last month
- β207Updated last year