Jaykef / mlx-rag-gguf
Minimal, clean code implementation of RAG with mlx using gguf model weights
☆49Updated 10 months ago
Alternatives and similar repositories for mlx-rag-gguf:
Users that are interested in mlx-rag-gguf are comparing it to the libraries listed below
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆37Updated 2 weeks ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- Run large models from the terminal using Apple MLX.☆29Updated 11 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 8 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆57Updated 10 months ago
- Implementation of nougat that focuses on processing pdf locally.☆79Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated last month
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆73Updated last year
- All the world is a play, we are but actors in it.☆47Updated this week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆73Updated 3 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆21Updated 8 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 6 months ago
- For inferring and serving local LLMs using the MLX framework☆96Updated 11 months ago
- A simple script to enhance text editing across your Mac, leveraging the power of MLX. Designed for seamless integration, it offers real-t…☆103Updated last year
- huggingface chat-ui integration with mlx-lm server☆60Updated last year
- ☆65Updated 9 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆59Updated 7 months ago
- ☆38Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 6 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆75Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated last month
- Distributed Inference for mlx LLm☆84Updated 7 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- mlx image models for Apple Silicon machines☆73Updated 3 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆116Updated 2 weeks ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆51Updated last year
- ☆29Updated 3 months ago