Keen-Technologies / physical_atariLinks
Platform for evaluating reinforcement learning (RL) algorithms on a physical Atari system.
☆145Updated 5 months ago
Alternatives and similar repositories for physical_atari
Users that are interested in physical_atari are comparing it to the libraries listed below
Sorting:
- seqax = sequence modeling + JAX☆170Updated 6 months ago
- Dion optimizer algorithm☆431Updated 3 weeks ago
- ☆251Updated last year
- A simple, performant and scalable JAX-based world modeling codebase.☆129Updated 3 weeks ago
- ☆62Updated 7 months ago
- ☆180Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆179Updated last year
- ☆291Updated last year
- SIMD quantization kernels☆94Updated 5 months ago
- Simple Transformer in Jax☆142Updated last year
- ☆215Updated last month
- ☆136Updated 2 months ago
- The history files when recording human interaction while solving ARC tasks☆117Updated 2 weeks ago
- Quantized LLM training in pure CUDA/C++.☆238Updated 3 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAX☆306Updated last year
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆671Updated 5 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆406Updated 7 months ago
- MoE training for Me and You and maybe other people☆353Updated this week
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆115Updated last month
- Solve puzzles. Learn CUDA.☆63Updated 2 years ago
- Solve puzzles to improve your tinygrad skills!☆178Updated 3 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated 2 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆221Updated 3 months ago
- Async RL Training at Scale☆1,044Updated this week
- An implementation of delta-iris in tinygrad☆72Updated last year
- ☆544Updated 6 months ago
- A 3D video game environment and benchmark designed from scratch for reinforcement learning research☆190Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆371Updated 7 months ago
- Tutorials on tinygrad☆456Updated 4 months ago
- commaVQ is a dataset of compressed driving video☆350Updated last week