teilomillet / retrainLinks
a Python library that uses Reinforcement Learning (RL) to train LLMs.
☆42Updated 2 months ago
Alternatives and similar repositories for retrain
Users that are interested in retrain are comparing it to the libraries listed below
Sorting:
- ☆35Updated 2 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆44Updated 8 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆70Updated last year
- ☆104Updated 4 months ago
- ☆47Updated 2 months ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 7 months ago
- ☆80Updated last month
- For LLMs to better code with Jina API☆170Updated last month
- A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your…☆108Updated 10 months ago
- prompt engineering experiments with DSPy GEPA and TextGrad☆56Updated last month
- ☆89Updated 9 months ago
- ☆113Updated 3 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated last year
- ☆68Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- Shared Memory Storage for Multi-Agent Systems☆126Updated 3 months ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆20Updated 5 months ago
- An OpenSource Deep Research library with reasoning☆161Updated last month
- A framework for hosting and scaling AI agents.☆38Updated 11 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- Instant Perfect Native MacOS Transcription☆46Updated 3 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆108Updated last week
- ☆78Updated 10 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Salesforce Enterprise Deep Research☆147Updated this week
- Unofficial Claude Code SDKs for Typescript and Python☆16Updated 5 months ago
- ☆17Updated 10 months ago
- Thoughtful Lightning AI Assistant - Dual-engine system with DeepSeek reasoning and Groq inference, featuring Gradio UI, secure API manage…☆20Updated 9 months ago
- ☆19Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 9 months ago