ericyuegu / halLinks
Training AI for Super Smash Bros. Melee
☆29Updated 4 months ago
Alternatives and similar repositories for hal
Users that are interested in hal are comparing it to the libraries listed below
Sorting:
- ☆42Updated last month
- ☆38Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆67Updated 11 months ago
- ☆27Updated last year
- H-Net Dynamic Hierarchical Architecture☆71Updated 2 weeks ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆69Updated 3 months ago
- look how they massacred my boy☆63Updated 9 months ago
- A simple, performant and scalable JAX-based world modeling codebase☆58Updated this week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 4 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- realtime latent world model inference demo☆47Updated 9 months ago
- ☆53Updated last year
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆25Updated 6 months ago
- ☆174Updated 4 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 9 months ago
- ☆23Updated 2 months ago
- Because it's there.☆16Updated 10 months ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 2 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆127Updated 8 months ago
- Simple repository for training small reasoning models☆32Updated 6 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆130Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆149Updated last month
- ☆34Updated 11 months ago
- ☆136Updated 4 months ago
- Lego for GRPO☆28Updated 2 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆33Updated 9 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆104Updated 5 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆209Updated 2 months ago