GusLovesMath / Local_LLM_Training_Apple_SiliconLinks
Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tuned the Llama3 model on 16 GPUs for streamlined solution of verbose math word problems. Result: a powerful, privacy-preserving chatbot that runs smoothly on-device.
β20Updated last year
Alternatives and similar repositories for Local_LLM_Training_Apple_Silicon
Users that are interested in Local_LLM_Training_Apple_Silicon are comparing it to the libraries listed below
Sorting:
- β28Updated 9 months ago
- Own your AI, search the web with itππβ87Updated 4 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.β40Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Modified to use local Ollama endpointβ50Updated 7 months ago
- A few examples of how Dendrite's SDK can be used to automate web processes and build AI agents.β37Updated 7 months ago
- Gradio chat interface for FastMLXβ12Updated 8 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Siliconβ17Updated last month
- LangChain + LiteLLM that worksβ44Updated 2 weeks ago
- Run CrewAI agent workflows on local LLM models with Llamafile and Ollamaβ40Updated last year
- huggingface chat-ui integration with mlx-lm serverβ60Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq APIβ¦β31Updated 8 months ago
- π§ Retrieval Augmented Generation (RAG) exampleβ16Updated 11 months ago
- β23Updated last year
- π€ Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAGβ32Updated 6 months ago
- Local first human friendly agents toolkit for the browser and Nodejsβ39Updated last week
- Your Python AI Coder!β34Updated 2 weeks ago
- A simple streamlit app that performs Retrieval-Augmented Generation over a corpus of presidential speechesβ17Updated last year
- Agentic RAG to help you build a startupπβ44Updated 2 months ago
- Access the Cohere Command R family of modelsβ37Updated 2 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weightsβ50Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOSβ36Updated 11 months ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ91Updated 11 months ago
- LocalPlexity is a lite version of Perplexity aimed at 100% privacy and openness. Everything is done locally, in your browser, from searchβ¦β15Updated 9 months ago
- β21Updated 7 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx frameworkβ74Updated last year
- β14Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β19Updated last week
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.β14Updated last year
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.β94Updated last year
- MLX implementation of xLSTM model by Beck et al. (2024)β27Updated last year