GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For more on 3SO: https://huggingface.co/blog/ucheog/llm-power-steering
☆135Feb 27, 2026Updated 2 months ago
Alternatives and similar repositories for Toolio
Users that are interested in Toolio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 4 months ago
- ☆92Jan 24, 2025Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 10 months ago
- FastMLX is a high performance production ready API to host MLX models.☆352Mar 18, 2025Updated last year
- mlx implementations of various transformers, speedups, training☆33Dec 14, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆21Oct 9, 2024Updated last year
- Client-side toolkit for using large language models, including where self-hosted☆115Feb 2, 2026Updated 3 months ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated last month
- Implementation of nougat that focuses on processing pdf locally.☆85Jan 15, 2025Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Sep 11, 2024Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆181Jan 31, 2024Updated 2 years ago
- ☆15May 17, 2024Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆33Mar 12, 2026Updated last month
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Oct 18, 2025Updated 6 months ago
- CLI tool for text to image generation using the FLUX.1 model.☆67Jun 28, 2025Updated 10 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- ☆44Jun 27, 2025Updated 10 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 11 months ago
- For inferring and serving local LLMs using the MLX framework☆114Mar 24, 2024Updated 2 years ago
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆64Apr 14, 2024Updated 2 years ago
- MLX native implementations of state-of-the-art generative image models☆2,037Apr 10, 2026Updated 3 weeks ago
- Swift implementation of Flux.1 using mlx-swift☆122Aug 10, 2025Updated 8 months ago
- The easiest way to run the fastest MLX-based LLMs locally☆323Oct 30, 2024Updated last year
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆360Apr 24, 2026Updated last week
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Aug 16, 2024Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆38Jun 21, 2024Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆80Jan 28, 2024Updated 2 years ago
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆91Jul 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.☆97Feb 5, 2024Updated 2 years ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆286Jun 16, 2025Updated 10 months ago
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Dec 4, 2025Updated 5 months ago
- On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025)☆76Jun 12, 2025Updated 10 months ago
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆263Oct 25, 2025Updated 6 months ago
- ☆10Oct 24, 2024Updated last year
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆225Apr 6, 2026Updated 3 weeks ago