Doriandarko / MLX-GRPOLinks
A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.
☆43Updated 7 months ago
Alternatives and similar repositories for MLX-GRPO
Users that are interested in MLX-GRPO are comparing it to the libraries listed below
Sorting:
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆69Updated last year
- ☆47Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated 11 months ago
- ☆78Updated 8 months ago
- auto fine tune of models with synthetic data☆76Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- ☆42Updated last year
- ☆17Updated last year
- Personal project, Generative AI, Streamlit, Python☆54Updated 4 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆39Updated last year
- Simple Graph Memory for AI applications☆90Updated 3 months ago
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated last year
- An AI Agent for Personal Self-Reflection☆59Updated 7 months ago
- ☆31Updated last month
- The next evolution of Agents☆47Updated last week
- A simple script to enhance text editing across your Mac, leveraging the power of MLX. Designed for seamless integration, it offers real-t…☆107Updated last year
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- ☆104Updated 3 months ago
- RAG example using DSPy, Gradio, FastAPI☆84Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 11 months ago
- ☆89Updated 8 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆85Updated this week
- ☆30Updated 9 months ago
- Welcome to FluidAPI, it's a framework that allows you to interact with APIs using natural language. No more JSON, headers, or complex for…☆31Updated last week
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆23Updated last month
- Dynamic Metadata based RAG Framework☆75Updated last year
- A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.☆41Updated 9 months ago