GusLovesMath / Llama3_MacSiliconLinks
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework, with install guide & perf tips. Aims to optimize LLM performance on Mac silicon for devs & researchers.
☆11Updated last year
Alternatives and similar repositories for Llama3_MacSilicon
Users that are interested in Llama3_MacSilicon are comparing it to the libraries listed below
Sorting:
- ☆11Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16Updated 5 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆18Updated 11 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated last week
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- BH hackathon☆13Updated last year
- Retrieval-augmented generation (RAG) for remote & local LLM use☆45Updated 4 months ago
- ☆54Updated last week
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 7 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆52Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- ☆47Updated last year
- ☆17Updated last year
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate h…☆21Updated last year
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- Scripts to create your own moe models using mlx☆90Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 4 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆44Updated 8 months ago
- ☆21Updated 11 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆13Updated 2 weeks ago
- Web Interface for Vision Language Models Including InternVLM2☆23Updated last year
- unsloth-5090-multiple☆52Updated 5 months ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆15Updated 11 months ago
- LLM reads a paper and produce a working prototype☆57Updated 6 months ago
- ☆20Updated last year
- Simple LLM inference server☆20Updated last year