GusLovesMath / Llama3_MacSiliconLinks
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework, with install guide & perf tips. Aims to optimize LLM performance on Mac silicon for devs & researchers.
☆12Updated last year
Alternatives and similar repositories for Llama3_MacSilicon
Users that are interested in Llama3_MacSilicon are comparing it to the libraries listed below
Sorting:
- ☆54Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 5 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- ☆47Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated this week
- ☆21Updated 10 months ago
- BH hackathon☆14Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- unsloth-5090-multiple☆50Updated 4 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆28Updated 3 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- GGUF Quantization of any LLM.☆40Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated this week
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- ☆11Updated 2 years ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 4 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆17Updated 4 months ago
- Retrieval-augmented generation (RAG) for remote & local LLM use☆45Updated 4 months ago
- ☆101Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆18Updated last year
- ☆31Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Updated 10 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆20Updated 6 months ago
- ☆45Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 7 months ago
- A forest of autonomous agents.☆19Updated 8 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year