haoliuhl / tux
Tools and Utils for Experiments (TUX)
☆16Updated 2 months ago
Alternatives and similar repositories for tux:
Users that are interested in tux are comparing it to the libraries listed below
- GPT implementation in Flax☆18Updated 3 years ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 5 months ago
- Building blocks for productive research☆52Updated 2 months ago
- ☆30Updated 4 months ago
- PyTorch Package For Quasimetric Learning☆41Updated 5 months ago
- Fast and reliable distributed systems in Python☆25Updated 2 months ago
- High quality implementations of imitation and inverse reinforcement learning algorithms☆14Updated 3 weeks ago
- Reinforcement Learning inside a 3D soccer simulation☆25Updated 7 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆13Updated 9 months ago
- General Modules for JAX☆64Updated 2 weeks ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated 9 months ago
- flexible meta-learning in jax☆12Updated last year
- Learning Robust Dynamics Through Variational Sparse Gating☆21Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 9 months ago
- Dreamer on JAX☆16Updated 3 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 5 months ago
- JAX implementation of the Mistral 7b v0.1 model☆13Updated last year
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆21Updated 11 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆28Updated 10 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- ☆20Updated 10 months ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- POPGym Library in JAX☆11Updated last year
- Learn online intrinsic rewards from LLM feedback☆35Updated 4 months ago
- ☆15Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated last year