sdan / nanoEBMLinks
minimal Energy-based transformer
☆32Updated this week
Alternatives and similar repositories for nanoEBM
Users that are interested in nanoEBM are comparing it to the libraries listed below
Sorting:
- ☆117Updated last week
- A simple, performant and scalable JAX-based world modeling codebase☆77Updated this week
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- 📄Small Batch Size Training for Language Models☆63Updated 3 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆109Updated 3 weeks ago
- H-Net Dynamic Hierarchical Architecture☆80Updated last month
- ☆28Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 8 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆113Updated last year
- ☆34Updated last year
- ☆120Updated 4 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- RLP: Reinforcement as a Pretraining Objective☆195Updated 3 weeks ago
- A basic pure pytorch implementation of flash attention☆16Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆68Updated last year
- ☆56Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 4 months ago
- Jax like function transformation engine but micro, microjax☆33Updated last year
- ☆13Updated 8 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆141Updated 6 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 2 weeks ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆28Updated last year
- ☆14Updated last year
- Simple repository for training small reasoning models☆44Updated 8 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 7 months ago
- ☆34Updated 11 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆116Updated last week
- ☆86Updated last year