ericyuegu / hal
Training AI for Super Smash Bros. Melee
☆26Updated last month
Alternatives and similar repositories for hal:
Users that are interested in hal are comparing it to the libraries listed below
- ☆27Updated 10 months ago
- ☆38Updated 9 months ago
- Focused on fast experimentation and simplicity☆72Updated 4 months ago
- ☆53Updated last year
- ☆37Updated last month
- Jax like function transformation engine but micro, microjax☆31Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆64Updated 2 weeks ago
- Lego for GRPO☆27Updated last month
- Simple repository for training small reasoning models☆27Updated 3 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆82Updated last month
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆24Updated 3 months ago
- Code for the paper "Function-Space Learning Rates"☆19Updated 3 weeks ago
- ☆21Updated 4 months ago
- look how they massacred my boy☆63Updated 6 months ago
- ☆22Updated 6 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆45Updated 3 months ago
- ☆19Updated last month
- Gymnasium environment for Pokemon Red☆36Updated 11 months ago
- Exploration into the Firefly algorithm in Pytorch☆38Updated 2 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆68Updated 2 weeks ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated this week
- σ-GPT: A New Approach to Autoregressive Models☆64Updated 8 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆24Updated 3 months ago
- Latent Large Language Models☆18Updated 8 months ago
- ☆94Updated 3 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated this week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- ☆81Updated last year