andrew-silva / mlx-rlhfLinks

An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.

☆32

Alternatives and similar repositories for mlx-rlhf

Users that are interested in mlx-rlhf are comparing it to the libraries listed below

Sorting:

xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆111Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
QuixiAI / kraken
☆66Updated last year
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆138Updated last year
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆42Updated last month
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
Jaykef / mlx-rag-gguf
Minimal, clean code implementation of RAG with mlx using gguf model weights
☆52Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 8 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆49Updated 8 months ago
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
devadigapratham / CoDSPy
An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…
☆20Updated 6 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆82Updated 3 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆71Updated 5 months ago
interstellarninja / MeeseeksAI
A framework for orchestrating AI agents using a mermaid graph
☆77Updated last year
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆101Updated 4 months ago
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
Alignment-Lab-AI / KnowledgeBase
never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…
☆37Updated last year
Peter-obi / Video_summarization_mlx
Transcribe and summarize videos using whisper and llms on apple mlx framework
☆75Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
brendanhogan / picoDeepResearch
☆64Updated 2 months ago
taylorai / mlx_embedding_models
run embeddings in MLX
☆90Updated 10 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year