GusLovesMath / Llama3_MacSilicon
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework, with install guide & perf tips. Aims to optimize LLM performance on Mac silicon for devs & researchers.
☆10Updated last year
Alternatives and similar repositories for Llama3_MacSilicon
Users that are interested in Llama3_MacSilicon are comparing it to the libraries listed below
Sorting:
- AI_Powered_Dev_Search_Engine☆12Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 3 weeks ago
- ☆11Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆20Updated 6 months ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated last year
- ☆10Updated 11 months ago
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- ☆29Updated last year
- ☆11Updated 11 months ago
- ☆21Updated 3 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29Updated last year
- A forest of autonomous agents.☆19Updated 3 months ago
- GPT-4V(ision) module for use with Autodistill.☆26Updated 9 months ago
- BH hackathon☆14Updated last year
- ☆28Updated last year
- Repo of the code from the Medium article☆20Updated 11 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆17Updated last week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ☆21Updated 6 months ago
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆13Updated last week
- ☆14Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆12Updated this week