Laz4rz / RLLinks
☆16Updated 8 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆59Updated this week
- ☆46Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆86Updated 3 months ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- ☆68Updated 4 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 7 months ago
- lossily compress representation vectors using product quantization☆59Updated 5 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated last month
- look how they massacred my boy☆63Updated last year
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆56Updated 5 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆68Updated 4 months ago
- A repository containing general tutorials I'd like to share with the world.☆46Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Simple Transformer in Jax☆139Updated last year
- Low memory full parameter finetuning of LLMs☆53Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 5 months ago
- SIMD quantization kernels☆87Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
- Quick Notebook Tutorials☆36Updated 3 months ago
- Training-Ready RL Environments + Evals☆128Updated this week
- Notebooks for fine tuning pali gemma☆117Updated 6 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆275Updated 10 months ago
- ☆45Updated 4 months ago
- ☆86Updated last year
- ☆75Updated last year
- smolLM with Entropix sampler on pytorch☆150Updated 11 months ago
- rl from zero pretrain, can it be done? yes.☆275Updated 3 weeks ago