Doriandarko / MLX-GRPOLinks
A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.
☆42Updated 6 months ago
Alternatives and similar repositories for MLX-GRPO
Users that are interested in MLX-GRPO are comparing it to the libraries listed below
Sorting:
- auto fine tune of models with synthetic data☆76Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆67Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago
- A couple scripts to grab stats from email☆43Updated 10 months ago
- ☆77Updated 7 months ago
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆49Updated 10 months ago
- ☆47Updated last year
- ☆29Updated 8 months ago
- The next evolution of Agents☆48Updated 2 weeks ago
- Tutorial for DSPy☆23Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- An AI Agent for Personal Self-Reflection☆59Updated 5 months ago
- ☆19Updated 6 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 10 months ago
- ☆62Updated 9 months ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of them☆39Updated 6 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆83Updated 4 months ago
- ☆42Updated last year
- ☆17Updated last year
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated last year
- ☆102Updated last month
- A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.☆35Updated 8 months ago
- Welcome to **FluidAPI**, a framework that allows you to interact with APIs using **natural language**. No more JSON, headers, or complex …☆31Updated last month
- ☆89Updated 6 months ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆89Updated 6 months ago
- Outputs from the Deep Writer☆16Updated 10 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 10 months ago
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆82Updated 9 months ago