andrew-silva/clean-rl-mlx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/andrew-silva/clean-rl-mlx)

andrew-silva / clean-rl-mlx

Clean RL implementation using MLX

☆34

Alternatives and similar repositories for clean-rl-mlx

Users that are interested in clean-rl-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

noahfarr / rlx
View on GitHub
A reinforcement learning framework based on MLX.
☆260Jul 1, 2026Updated 3 weeks ago
armbues / SiLLM-examples
View on GitHub
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
☆16May 8, 2025Updated last year
mzbac / mlx-llm-server
View on GitHub
For inferring and serving local LLMs using the MLX framework
☆115Mar 24, 2024Updated 2 years ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
mlx-chat / mlx-chat-app
View on GitHub
Chat with MLX is a high-performance macOS application that connects your local documents to a personalized large language model (LLM).
☆178Mar 8, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
riccardomusmeci / mlx-image
View on GitHub
mlx image models for Apple Silicon machines
☆100Apr 8, 2026Updated 3 months ago
Doriandarko / mlx-local-server
View on GitHub
A tiny server to run local inference on MLX model in the style of OpenAI
☆13Jan 31, 2024Updated 2 years ago
ziozzang / Mac_mlx_phi-2_server
View on GitHub
Test server code for Phi-2 model. support OpenAI API spec
☆18Dec 15, 2023Updated 2 years ago
Agora-Lab-AI / Atom
View on GitHub
a suite of finetuned LLMs for atomically precise function calling 🧪
☆16Updated this week
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
stockeh / mlx-optimizers
View on GitHub
A collection of optimizers for MLX
☆57Dec 12, 2025Updated 7 months ago
XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
nih23 / deepFibreTracking
View on GitHub
Development and evaluation of different approaches for fibre tracking of diffusion weighted MRI data.
☆10May 9, 2022Updated 4 years ago
arc-community / arc-generative-DSL-infinite-data
View on GitHub
slowly building a set of infinite riddle generators for data-hungry methods
☆14Nov 15, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
rabilrbl / llamafile-builder
View on GitHub
A simple github actions script to build a llamafile and uploads to huggingface
☆17Jan 11, 2024Updated 2 years ago
lobehub / lobe-flow
View on GitHub
🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor
☆22Dec 18, 2023Updated 2 years ago
galdl / rl_delay_basic
View on GitHub
Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.
☆14Sep 12, 2023Updated 2 years ago
tyler-romero / microR1
View on GitHub
Simple repository for training small reasoning models
☆51Feb 17, 2026Updated 5 months ago
bigcode-project / bigcodebench-annotation
View on GitHub
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
☆26Aug 8, 2024Updated last year
nf-neuro / modules
View on GitHub
nf-neuro is a nextflow neuroimaging library maintained by the SCIL team
☆21Jun 26, 2026Updated last month
armbues / SiLLM
View on GitHub
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
☆283Jun 16, 2025Updated last year
apeatling / simple-guide-to-mlx-finetuning
View on GitHub
Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.
☆97Feb 5, 2024Updated 2 years ago
BruceGeLi / TCE_RL
View on GitHub
Temporally Correlated Episodic Reinforcement Learning, ICLR 24
☆12Apr 8, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
young-geng / mlxu
View on GitHub
Machine Learning eXperiment Utilities
☆48Jul 29, 2025Updated last year
amazon-science / fast-rl-with-slow-updates
View on GitHub
☆18Sep 7, 2023Updated 2 years ago
nmboffi / sbtm
View on GitHub
Repository for score-based transport modeling.
☆11Jul 22, 2023Updated 3 years ago
stockeh / mlx-grokking
View on GitHub
Grokking on modular arithmetic in less than 150 epochs in MLX
☆15Oct 24, 2024Updated last year
DAMO-NLP-SG / Multipurpose-Chatbot
View on GitHub
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
☆20Apr 18, 2024Updated 2 years ago
j-webtek / Local-LLM_FineTune
View on GitHub
Finetune Your Local LLM
☆18Sep 23, 2023Updated 2 years ago
xeophon / beam
View on GitHub
☆16Feb 22, 2026Updated 5 months ago
sdothum / dotfiles
View on GitHub
☆13Updated this week
mlx-graphs / mlx-graphs
View on GitHub
Graph Neural Network library made for Apple Silicon
☆222Mar 2, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
qpwo / dsv3-lowmem
View on GitHub
run deepseek v3 on a single node. Drops unused experts from memory.
☆16Jan 26, 2025Updated last year
AShar97 / ML-RL-in-Finance
View on GitHub
Machine Learning and Reinforcement Learning in Finance Specialization (MOOC) Assignments
☆12Nov 4, 2021Updated 4 years ago
kiharalab / RL-MLZerD
View on GitHub
☆13May 26, 2022Updated 4 years ago
DDD71 / code71
View on GitHub
Algorithm implementation, including ML, RL, OR.
☆12Feb 22, 2022Updated 4 years ago
ArijanJ / midi-converter
View on GitHub
A simple tool that converts MIDI files to QWERTY sheets for playing on your favorite VP platform.
☆11Mar 3, 2026Updated 4 months ago
YuchenJin / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆15Dec 5, 2024Updated last year
gevtushenko / block_matrix_format_performance
View on GitHub
☆12Jan 19, 2020Updated 6 years ago