alexiglad/EBT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alexiglad/EBT)

alexiglad / EBT

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

☆640

Alternatives and similar repositories for EBT

Users that are interested in EBT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yilundu / ired_code_release
View on GitHub
☆95Jun 14, 2024Updated 2 years ago
sdan / nanoEBM
View on GitHub
minimal Energy-based transformer
☆44Dec 11, 2025Updated 7 months ago
SakanaAI / continuous-thought-machines
View on GitHub
Continuous Thought Machines, because thought takes time and reasoning is a process.
☆2,004Dec 29, 2025Updated 7 months ago
UbiquantAI / URM
View on GitHub
Universal Reasoning Model
☆134Jan 15, 2026Updated 6 months ago
soran-ghaderi / torchebm
View on GitHub
🍓 Simulation-free, GPU-first generative modeling in PyTorch ⚡ Composable primitives for scalable, stable training of modern EBMs, diffus…
☆106Jul 21, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
raywang4 / EqM
View on GitHub
☆209Oct 9, 2025Updated 9 months ago
alexOarga / compositional_reasoning
View on GitHub
[NeurIPS'25] Generalizable Reasoning through Compositional Energy Minimization
☆29Oct 28, 2025Updated 9 months ago
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,402Mar 23, 2026Updated 4 months ago
louaaron / Score-Entropy-Discrete-Diffusion
View on GitHub
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆739Feb 29, 2024Updated 2 years ago
sapientinc / HRM
View on GitHub
Hierarchical Reasoning Model Official Release
☆12,599Mar 31, 2026Updated 3 months ago
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,025Jul 10, 2025Updated last year
amorehead / jvp_flash_attention
View on GitHub
Flash Attention Triton kernel with support for second-order derivatives
☆180May 14, 2026Updated 2 months ago
Lemon-cmd / energy-transformer-graph
View on GitHub
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…
☆27Jan 28, 2024Updated 2 years ago
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆627Feb 15, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ZHZisZZ / dllm
View on GitHub
dLLM: Simple Diffusion Language Modeling
☆2,658Jul 17, 2026Updated last week
buoyancy99 / diffusion-forcing
View on GitHub
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
☆1,280Jul 6, 2026Updated 3 weeks ago
goombalab / hnet
View on GitHub
H-Net: Hierarchical Network with Dynamic Chunking
☆869Nov 20, 2025Updated 8 months ago
ML-GSAI / LLaDA
View on GitHub
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,917Jul 15, 2026Updated 2 weeks ago
m1balcerak / EnergyMatching
View on GitHub
[NeurIPS 2025] Official repository for "Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling"
☆226Jul 13, 2026Updated 2 weeks ago
galilai-group / lejepa
View on GitHub
☆1,293Jan 25, 2026Updated 6 months ago
yilundu / irem_code_release
View on GitHub
ICML 2022: Learning Iterative Reasoning through Energy Minimization
☆48Feb 27, 2023Updated 3 years ago
facebookresearch / blt
View on GitHub
Code for BLT research paper
☆2,053Nov 3, 2025Updated 8 months ago
facebookresearch / jepa-intuitive-physics
View on GitHub
This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"
☆265Jun 3, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SakanaAI / natural_niches
View on GitHub
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆170Aug 25, 2025Updated 11 months ago
RobertRosenbaum / Torch2PC
View on GitHub
Software for using predictive coding algorithms to train PyTorch models.
☆29Feb 13, 2024Updated 2 years ago
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆563Apr 3, 2026Updated 3 months ago
yifanzhang-pro / deep-delta-learning
View on GitHub
Official Project Page for Deep Delta Learning (https://arxiv.org/abs/2601.00417)
☆356Updated this week
facebookresearch / cwm
View on GitHub
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
☆883Jul 17, 2026Updated last week
kuleshov-group / mdlm
View on GitHub
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆703Sep 29, 2025Updated 10 months ago
lucidrains / PoPE-pytorch
View on GitHub
Efficient implementation (and explorations) into polar coordinate positional embedding (PoPE) - from Gopalakrishnan et al. under Schmidhu…
☆71Jun 21, 2026Updated last month
lucidrains / titans-pytorch
View on GitHub
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
☆1,970Jul 13, 2026Updated 2 weeks ago
facebookresearch / eb_jepa
View on GitHub
An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and e…
☆748Jul 17, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
raymin0223 / mixture_of_recursions
View on GitHub
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)
☆579Sep 26, 2025Updated 10 months ago
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆122Jul 13, 2026Updated 2 weeks ago
olsdavis / semicat
View on GitHub
Official implementation of Categorical Flow Maps on text.
☆67Feb 16, 2026Updated 5 months ago
SamsungSAILMontreal / TinyRecursiveModels
View on GitHub
☆6,572Apr 1, 2026Updated 3 months ago
ESHyperscale / HyperscaleES
View on GitHub
Jax Codebase for Evolutionary Strategies at the Hyperscale
☆351Feb 27, 2026Updated 5 months ago
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 11 months ago