ivanleomk / modal-grpo
☆13Updated last week
Alternatives and similar repositories for modal-grpo:
Users that are interested in modal-grpo are comparing it to the libraries listed below
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- Apps that run on modal.com☆12Updated 9 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 6 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Verbosity control for AI agents☆60Updated 10 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- alternative way to calculating self attention☆18Updated 10 months ago
- ☆38Updated 7 months ago
- BH hackathon☆14Updated 11 months ago
- ☆31Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 2 months ago
- Example code using the DSPy framework.☆18Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆29Updated last month
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated last month
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Simple GRPO scripts and configurations.☆58Updated last month
- ☆1Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Because it's there.☆15Updated 6 months ago
- The world's first fully automated VC fund.☆20Updated last week
- ☆26Updated last year
- LLM reads a paper and produce a working prototype☆51Updated last week
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated 3 weeks ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 6 months ago