MekkCyber / EdgeLLMLinks
Deploying large language models on edge
☆81Updated 3 months ago
Alternatives and similar repositories for EdgeLLM
Users that are interested in EdgeLLM are comparing it to the libraries listed below
Sorting:
- React Native binding of llama.cpp☆767Updated this week
- One click templates for inferencing Language Models☆223Updated last month
- Fast parallel LLM inference for MLX☆241Updated last year
- React Native binding of llama.cpp☆45Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Own your AI, search the web with it🌐😎☆94Updated 11 months ago
- ☆107Updated 2 months ago
- ☆301Updated 5 months ago
- ☆182Updated 10 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 2 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- From data to vector database effortlessly☆88Updated 7 months ago
- ☆134Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 4 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆82Updated 4 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆276Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆456Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago
- Sparse Inferencing for transformer based LLMs☆216Updated 5 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆331Updated this week
- Kyutai with an "eye"☆232Updated 9 months ago
- On-device intelligence.☆392Updated 9 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆116Updated last year
- Self-host LLMs with vLLM and BentoML☆163Updated last month
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 3 months ago
- ☆79Updated 3 months ago
- Together Open Deep Research☆356Updated 8 months ago
- Runnable examples for LiveKit Agents in Python☆227Updated this week
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀☆63Updated 3 months ago