GusLovesMath / Local_LLM_Training_Apple_SiliconLinks

Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tuned the Llama3 model on 16 GPUs for streamlined solution of verbose math word problems. Result: a powerful, privacy-preserving chatbot that runs smoothly on-device.

☆20

Alternatives and similar repositories for Local_LLM_Training_Apple_Silicon

Users that are interested in Local_LLM_Training_Apple_Silicon are comparing it to the libraries listed below

Sorting:

rapidarchitect / ollama-crew-mesop
☆28Updated 9 months ago
AstraBert / PrAIvateSearch
Own your AI, search the web with it🌐😎
☆87Updated 4 months ago
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆40Updated this week
victorb / ollama-swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Modified to use local Ollama endpoint
☆50Updated 7 months ago
dendrite-systems / dendrite-examples
A few examples of how Dendrite's SDK can be used to automate web processes and build AI agents.
☆37Updated 7 months ago
ndurner / mlx_chat
Gradio chat interface for FastMLX
☆12Updated 8 months ago
armbues / SiLLM-examples
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
☆17Updated last month
stanford-oval / chainlite
LangChain + LiteLLM that works
☆44Updated 2 weeks ago
heaversm / crew-llamafile
Run CrewAI agent workflows on local LLM models with Llamafile and Ollama
☆40Updated last year
mzbac / mlx-chat-ui
huggingface chat-ui integration with mlx-lm server
☆60Updated last year
unclecode / whisperanywhere-js
WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…
☆31Updated 8 months ago
AbeEstrada / mlx-rag
🧠 Retrieval Augmented Generation (RAG) example
☆16Updated 11 months ago
InsightEdge01 / ScrapegraphAIOllamallama3
☆23Updated last year
enso-labs / llm-server
🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG
☆32Updated 6 months ago
synw / agent-smith
Local first human friendly agents toolkit for the browser and Nodejs
☆39Updated last week
truemagic-coder / nemo-agent
Your Python AI Coder!
☆34Updated 2 weeks ago
definitive-io / presidential-speeches-rag
A simple streamlit app that performs Retrieval-Augmented Generation over a corpus of presidential speeches
☆17Updated last year
AstraBert / ragcoon
Agentic RAG to help you build a startup🚀
☆44Updated 2 months ago
simonw / llm-command-r
Access the Cohere Command R family of models
☆37Updated 2 months ago
Jaykef / mlx-rag-gguf
Minimal, clean code implementation of RAG with mlx using gguf model weights
☆50Updated last year
Attunewise / GPT
OpenAI GPT hosted Agent Framework for Windows and MacOS
☆36Updated 11 months ago
Aesthisia / LLMinator
Gradio based tool to run opensource LLM models directly from Huggingface
☆91Updated 11 months ago
legraphista / localplexity
LocalPlexity is a lite version of Perplexity aimed at 100% privacy and openness. Everything is done locally, in your browser, from search…
☆15Updated 9 months ago
gradio-app / sambanova-gradio
☆21Updated 7 months ago
Peter-obi / Video_summarization_mlx
Transcribe and summarize videos using whisper and llms on apple mlx framework
☆74Updated last year
MikeBirdTech / open-interpreter-python-templates
☆14Updated last year
The-Swarm-Corporation / OmniParse
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆19Updated last week
AIAnytime / On-device-real-time-RAG-App
On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.
☆14Updated last year
apeatling / simple-guide-to-mlx-finetuning
Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.
☆94Updated last year
abeleinin / mlx-xLSTM
MLX implementation of xLSTM model by Beck et al. (2024)
☆27Updated last year