ericyuegu / halLinks
Training AI for Super Smash Bros. Melee
☆30Updated 7 months ago
Alternatives and similar repositories for hal
Users that are interested in hal are comparing it to the libraries listed below
Sorting:
- ☆40Updated last year
 - ☆28Updated last year
 - H-Net Dynamic Hierarchical Architecture☆80Updated last month
 - σ-GPT: A New Approach to Autoregressive Models☆68Updated last year
 - ☆51Updated 3 months ago
 - ☆53Updated last year
 - Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last year
 - look how they massacred my boy☆63Updated last year
 - Generative cellular automaton-like learning environments for RL.☆19Updated 9 months ago
 - Approximating the joint distribution of language models via MCTS☆22Updated last year
 - ☆24Updated 5 months ago
 - ☆197Updated 2 months ago
 - gzip Predicts Data-dependent Scaling Laws☆34Updated last year
 - ☆103Updated 3 months ago
 - Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 4 months ago
 - Latent Program Network (from the "Searching Latent Program Spaces" paper)☆102Updated last month
 - ☆34Updated last year
 - Efficient World Models with Context-Aware Tokenization. ICML 2024☆113Updated last year
 - Because it's there.☆16Updated last year
 - ☆19Updated 5 months ago
 - an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 7 months ago
 - Code for the Fractured Entangled Representation Hypothesis position paper!☆203Updated 5 months ago
 - Simple Transformer in Jax☆139Updated last year
 - Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
 - Collection of autoregressive model implementation☆86Updated 6 months ago
 - Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 11 months ago
 - Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆193Updated last year
 - realtime latent world model inference demo☆47Updated 11 months ago
 - ☆53Updated last year
 - DeMo: Decoupled Momentum Optimization☆195Updated 11 months ago