MekkCyber / EdgeLLMLinks

Deploying large language models on edge

☆62

Alternatives and similar repositories for EdgeLLM

Users that are interested in EdgeLLM are comparing it to the libraries listed below

Sorting:

N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆81Updated 2 months ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆242Updated 2 weeks ago
QuixiAI / kraken
☆66Updated last year
dottxt-ai / demos
☆106Updated 3 months ago
huggingface / large-scale-image-deduplication
☆112Updated 2 weeks ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆176Updated 9 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆100Updated 6 months ago
AK391 / dailypapersHN
☆86Updated 9 months ago
togethercomputer / finetuning
Finetune Llama-3-8b on the MathInstruct dataset
☆110Updated 9 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
AstraBert / PrAIvateSearch
Own your AI, search the web with it🌐😎
☆86Updated 6 months ago
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆42Updated 3 weeks ago
Vaibhavs10 / experiments-with-mcp
☆96Updated last month
justrach / bhumi
⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀
☆57Updated last month
itsPreto / VECTR8
Embed anything.
☆28Updated last year
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆134Updated 2 weeks ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated last week
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 5 months ago
cognitivecomputations / dolphin-logger
☆101Updated last month
cartesia-ai / edge
On-device intelligence.
☆363Updated 3 months ago
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆150Updated 5 months ago
menloresearch / ReZero
☆156Updated 3 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆72Updated this week
AstraBert / ingest-anything
From data to vector database effortlessly
☆77Updated 2 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 8 months ago
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆71Updated 4 months ago
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆93Updated 11 months ago