sdan / nanoEBMLinks
minimal Energy-based transformer
☆40Updated 3 weeks ago
Alternatives and similar repositories for nanoEBM
Users that are interested in nanoEBM are comparing it to the libraries listed below
Sorting:
- ☆122Updated last week
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 2 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆28Updated last year
- A simple, performant and scalable JAX-based world modeling codebase☆113Updated last month
- ☆28Updated last year
- 📄Small Batch Size Training for Language Models☆64Updated last month
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 9 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- RLP: Reinforcement as a Pretraining Objective☆201Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆144Updated 6 months ago
- Jax like function transformation engine but micro, microjax☆33Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆114Updated last year
- ☆119Updated 5 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆106Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 4 months ago
- ☆77Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆45Updated 6 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 10 months ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated last year
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆104Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 8 months ago
- ☆34Updated last year
- ☆157Updated 3 months ago
- ☆46Updated 8 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆85Updated 2 months ago