gkswamy98 / dotfiles
some of my configs
☆9Updated 3 years ago
Alternatives and similar repositories for dotfiles:
Users that are interested in dotfiles are comparing it to the libraries listed below
- Nethack Learning Environment Wrapper for Language Interface☆35Updated last year
- Code for magnetic mirror descent.☆15Updated last year
- ☆9Updated 11 months ago
- Scaling scaling laws with board games.☆46Updated last year
- An Open-Ended Agentic Simulator☆36Updated 5 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆230Updated last week
- Minimal but scalable implementation of large language models in JAX☆28Updated 2 months ago
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆14Updated 2 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆18Updated 7 months ago
- Simple JAX Graphics Library.☆29Updated 2 months ago
- PAIRED in PyTorch 🔥☆57Updated last year
- Synchronized Curriculum Learning for RL Agents☆32Updated this week
- ☆46Updated 8 months ago
- ☆72Updated 2 months ago
- ☆18Updated this week
- Library to compare and evaluate reward functions☆64Updated last year
- ☆22Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆176Updated 5 months ago
- Jiminy Cricket Environment (NeurIPS 2021)☆24Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆138Updated last year
- ☆19Updated 9 months ago
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆31Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- A 3D video game environment and benchmark designed from scratch for reinforcement learning research☆181Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆43Updated 7 months ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 3 years ago
- Machine Learning for Alignment Bootcamp☆70Updated 2 years ago
- Inference code for LLaMA models in JAX☆114Updated 8 months ago