Keen-Technologies / physical_atariLinks
Platform for evaluating reinforcement learning (RL) algorithms on a physical Atari system.
☆136Updated 3 months ago
Alternatives and similar repositories for physical_atari
Users that are interested in physical_atari are comparing it to the libraries listed below
Sorting:
- seqax = sequence modeling + JAX☆168Updated 4 months ago
- ☆285Updated last year
- Cost aware hyperparameter tuning algorithm☆176Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆357Updated 5 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆657Updated 3 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆71Updated 11 months ago
- ☆128Updated 3 weeks ago
- ☆201Updated 3 months ago
- A simple, performant and scalable JAX-based world modeling codebase.☆116Updated last month
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Updated 9 months ago
- The NetHack Learning Environment☆96Updated last week
- ☆176Updated last week
- ☆56Updated 5 months ago
- A reinforcement learning codebase focusing on the emergence of cooperation and alignment in multi-agent AI systems.☆134Updated this week
- ☆108Updated last week
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- Training API and CLI☆253Updated last week
- Dion optimizer algorithm☆403Updated last week
- Efficient baselines for autocurricula in JAX.☆201Updated last year
- A 3D video game environment and benchmark designed from scratch for reinforcement learning research☆190Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆106Updated 2 weeks ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆62Updated 2 years ago
- Async RL Training at Scale☆938Updated this week
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆405Updated 5 months ago
- ☆532Updated 4 months ago
- Simple Transformer in Jax☆140Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 weeks ago
- SIMD quantization kernels☆93Updated 3 months ago
- A fast and robust algorithm for temporal difference learning☆21Updated last week
- Benchmarking Agentic LLM and VLM Reasoning On Games☆214Updated last week