Clean RL implementation using MLX
☆34Mar 8, 2024Updated 2 years ago
Alternatives and similar repositories for clean-rl-mlx
Users that are interested in clean-rl-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆38Jun 21, 2024Updated last year
- A reinforcement learning framework based on MLX.☆254Dec 1, 2025Updated 4 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 6 months ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Chat with MLX is a high-performance macOS application that connects your local documents to a personalized large language model (LLM).☆178Mar 8, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- mlx image models for Apple Silicon machines☆95Apr 8, 2026Updated last week
- ☆38Mar 12, 2024Updated 2 years ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆20Nov 11, 2024Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Apr 8, 2026Updated last week
- Test server code for Phi-2 model. support OpenAI API spec☆18Dec 15, 2023Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- A collection of optimizers for MLX☆57Dec 12, 2025Updated 4 months ago
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Development and evaluation of different approaches for fibre tracking of diffusion weighted MRI data.☆10May 9, 2022Updated 3 years ago
- moodist☆27Apr 3, 2026Updated 2 weeks ago
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 2 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆23Dec 18, 2023Updated 2 years ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Sep 12, 2023Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- Implementation of the ICLR 2022 paper "Phase Collapse in Neural Networks."☆10Mar 21, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Simple GUI to load a PDF/Docx/txt file and have LM Studio Answer based off of it.☆14Jul 31, 2024Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆284Jun 16, 2025Updated 10 months ago
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- Repository for score-based transport modeling.☆11Jul 22, 2023Updated 2 years ago
- For MHC-I protein-peptide binding predictions: Deep Learning model with CNN and Snakemake workflow☆13Oct 22, 2018Updated 7 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated 2 years ago
- LLVM Version Manager☆11Apr 21, 2017Updated 8 years ago
- Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)☆17Mar 6, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated 2 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆18May 28, 2025Updated 10 months ago
- nyc is so back☆21Jun 27, 2025Updated 9 months ago
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- run deepseek v3 on a single node. Drops unused experts from memory.☆16Jan 26, 2025Updated last year