GusLovesMath / Llama3_MacSiliconLinks
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework, with install guide & perf tips. Aims to optimize LLM performance on Mac silicon for devs & researchers.
☆12Updated last year
Alternatives and similar repositories for Llama3_MacSilicon
Users that are interested in Llama3_MacSilicon are comparing it to the libraries listed below
Sorting:
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Updated 9 months ago
- BH hackathon☆14Updated last year
- ☆21Updated 9 months ago
- ☆47Updated last year
- A forest of autonomous agents.☆19Updated 7 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆13Updated last week
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆17Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆18Updated last year
- ☆16Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 3 weeks ago
- GGUF Quantization of any LLM.☆40Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 4 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated 10 months ago
- ☆11Updated 2 years ago
- ☆54Updated last week
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 3 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- Simple LLM inference server☆20Updated last year
- ☆31Updated last year
- ☆52Updated last week
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 2 months ago
- Modified Beam Search with periodical restart☆12Updated 11 months ago
- Web Interface for Vision Language Models Including InternVLM2☆23Updated last year
- Your Python AI Coder!☆35Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆34Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year