ivanbelenky / RLLinks
R.L. methods and techniques.
☆196Updated 7 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 4 months ago
- Lamport's Bakery Algorithm Demonstrated in Python☆96Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆252Updated last year
- Automated, smooth, N'th order derivatives of non-uniformly sampled time series data☆226Updated 8 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆203Updated 10 months ago
- ☆248Updated last year
- Run and explore Llama models locally with minimal dependencies on CPU☆191Updated 9 months ago
- A BERT that you can train on a (gaming) laptop.☆209Updated last year
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆36Updated 2 years ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆111Updated 3 weeks ago
- time to learn mlx☆40Updated last month
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆631Updated 3 weeks ago
- Docker-based inference engine for AMD GPUs☆231Updated 9 months ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆97Updated last year
- Grandmaster-Level Chess Without Search☆584Updated 6 months ago
- Optimally allocate poker chips using constrained, nonlinear optimization☆174Updated 6 months ago
- Rewriting Principia Mathematica in Lean☆132Updated 7 months ago
- Sequential Logic☆110Updated last week
- This is a python implementation for stitching images.☆232Updated 9 months ago
- Grow virtual creatures in static and physics simulated environments.☆53Updated last year
- High-Performance Klong array language in Python.☆303Updated last week
- Algebraic enhancements for GEMM & AI accelerators☆277Updated 4 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆237Updated 2 years ago
- ☆123Updated last month
- A tiny autograd engine with a Jax-like API☆61Updated this week
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- ☆195Updated 2 months ago
- Tensor library & inference framework for machine learning☆99Updated 2 weeks ago
- Dead Simple LLM Abliteration☆220Updated 4 months ago
- ☆183Updated 6 months ago