geohot / dumbrlLinks
Can RL solve simple problems?
☆54Updated last year
Alternatives and similar repositories for dumbrl
Users that are interested in dumbrl are comparing it to the libraries listed below
Sorting:
- An implementation of delta-iris in tinygrad☆72Updated 11 months ago
- Scripts and environment for the tinybox☆94Updated last year
- Cost aware hyperparameter tuning algorithm☆166Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Efficient baselines for autocurricula in JAX.☆191Updated 11 months ago
- ☆42Updated 3 weeks ago
- Noob Lessons from Stream about how GPUs work☆124Updated 3 months ago
- commaVQ is a dataset of compressed driving video☆322Updated this week
- parallelized hyperdimensional tictactoe☆118Updated 11 months ago
- The Tensor (or Array)☆441Updated 11 months ago
- ☆138Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- ☆27Updated last year
- ☆87Updated last week
- Because it's there.☆16Updated 10 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated last year
- A really tiny autograd engine☆95Updated 2 months ago
- This repository contain the simple llama3 implementation in pure jax.☆68Updated 5 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 6 months ago
- Exploration into the Firefly algorithm in Pytorch☆40Updated 5 months ago
- ☆136Updated 9 months ago
- tiny code to access tenstorrent blackhole☆57Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆286Updated last year
- comma body does a loop around the office☆26Updated last year
- fast + parallel AlphaZero in JAX☆97Updated 7 months ago
- Fast bare-bones BPE for modern tokenizer training☆164Updated last month
- Alex Krizhevsky's original code from Google Code☆195Updated 9 years ago
- Distributed RL framework for solving the SoulsGym environments☆33Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆118Updated last week
- Accelerated minigrid environments with JAX☆139Updated 2 weeks ago