gkswamy98 / dotfiles
some of my configs
☆9Updated 3 years ago
Alternatives and similar repositories for dotfiles:
Users that are interested in dotfiles are comparing it to the libraries listed below
- Machine Learning for Alignment Bootcamp☆72Updated 2 years ago
- (Model-written) LLM evals library☆18Updated 4 months ago
- Nethack Learning Environment Wrapper for Language Interface☆37Updated last year
- The NetHack Learning Environment☆68Updated 2 weeks ago
- seqax = sequence modeling + JAX☆154Updated 2 weeks ago
- ☆9Updated last year
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 5 months ago
- ☆19Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆80Updated last year
- Train very large language models in Jax.☆204Updated last year
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- ☆22Updated 3 weeks ago
- Python library for easily making web Apps to compare humans and AI☆25Updated this week
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated 10 months ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆131Updated 8 months ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆14Updated last month
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆18Updated 5 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆181Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆120Updated 2 years ago
- ☆18Updated 2 months ago
- Scaling scaling laws with board games.☆48Updated last year
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- ☆75Updated 5 months ago
- A simple library for scaling up JAX programs☆134Updated 5 months ago
- PAIRED in PyTorch 🔥☆59Updated 2 years ago
- ☆89Updated last month
- An Open-Ended Agentic Simulator☆47Updated 8 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆104Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago