Doriandarko / MLX-GRPOLinks
A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.
☆44Updated 8 months ago
Alternatives and similar repositories for MLX-GRPO
Users that are interested in MLX-GRPO are comparing it to the libraries listed below
Sorting:
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆70Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- ☆47Updated last year
- ☆78Updated 10 months ago
- ☆42Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 9 months ago
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- The next evolution of Agents☆47Updated last week
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 10 months ago
- ☆30Updated 10 months ago
- Simple Graph Memory for AI applications☆89Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- Outputs from the Deep Writer☆16Updated last year
- ☆19Updated 9 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- ☆17Updated last year
- A framework for hosting and scaling AI agents.☆38Updated 11 months ago
- Tools to simplify life with AI☆27Updated 6 months ago
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆11Updated 11 months ago
- RAG example using DSPy, Gradio, FastAPI☆85Updated last year
- ☆89Updated 9 months ago
- ☆104Updated 4 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆42Updated 2 months ago
- An AI Agent for Personal Self-Reflection☆60Updated 8 months ago
- Personal project, Generative AI, Streamlit, Python☆54Updated 5 months ago
- ☆35Updated 2 months ago