noahfarr / rlxLinks
A reinforcement learning framework based on MLX.
☆248Updated 2 months ago
Alternatives and similar repositories for rlx
Users that are interested in rlx are comparing it to the libraries listed below
Sorting:
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆117Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- run paligemma in real time☆133Updated last year
- General multi-task deep RL Agent☆185Updated last year
- The history files when recording human interaction while solving ARC tasks☆117Updated 2 weeks ago
- Fast parallel LLM inference for MLX☆246Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- gpt-2 from scratch in mlx☆414Updated last year
- ☆134Updated last year
- Efficient framework-agnostic data loading☆459Updated 4 months ago
- ☆112Updated 2 years ago
- Cost aware hyperparameter tuning algorithm☆179Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆106Updated 5 months ago
- Simple Transformer in Jax☆142Updated last year
- Start a server from the MLX library.☆198Updated last year
- smolLM with Entropix sampler on pytorch☆149Updated last year
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"☆111Updated last year
- ☆125Updated last year
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆115Updated last month
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆215Updated last month
- Plotting (entropy, varentropy) for small LMs☆99Updated 8 months ago
- Fast bare-bones BPE for modern tokenizer training☆175Updated 7 months ago
- ☆62Updated 7 months ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆178Updated 2 years ago
- ☆67Updated 6 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆173Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- port of Andrjey Karpathy's llm.c to Mojo☆363Updated 6 months ago
- A puzzle to learn about prompting☆135Updated 2 years ago
- This repository contain the simple llama3 implementation in pure jax.☆71Updated 11 months ago