PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆606Mar 1, 2026Updated this week
Alternatives and similar repositories for EBT
Users that are interested in EBT are comparing it to the libraries listed below
Sorting:
- ☆88Jun 14, 2024Updated last year
- Official repository for the paper "Flow Equivariant Recurrent Neural Networks"☆34Jul 2, 2025Updated 8 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆72Jan 9, 2026Updated last month
- ☆53Oct 29, 2025Updated 4 months ago
- ☆13Jul 9, 2024Updated last year
- Versatile human antibody sequence design☆22May 27, 2025Updated 9 months ago
- Work in progress.☆79Nov 25, 2025Updated 3 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆199Dec 18, 2025Updated 2 months ago
- ☆21Oct 22, 2025Updated 4 months ago
- ☆19Jun 19, 2025Updated 8 months ago
- Geometric Algebra Flow Matching (GAFL) for Protein Backbone Generation☆17Oct 31, 2025Updated 4 months ago
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆639Sep 29, 2025Updated 5 months ago
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated 2 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆109Oct 22, 2025Updated 4 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Oct 6, 2024Updated last year
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- [ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"☆29Jun 2, 2025Updated 9 months ago
- Code for BLT research paper☆2,029Nov 3, 2025Updated 4 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Massively-Parallel Natural Extension of Reference Frame☆34Jan 18, 2023Updated 3 years ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- ☆216Jan 5, 2026Updated 2 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆864Dec 29, 2025Updated 2 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,189Jan 30, 2025Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,788Dec 29, 2025Updated 2 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Oct 27, 2024Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- ☆40Oct 2, 2025Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,566Feb 27, 2025Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆133Dec 3, 2024Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆108Nov 25, 2025Updated 3 months ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Mar 4, 2023Updated 3 years ago
- ☆13Oct 5, 2025Updated 5 months ago
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆963Jul 10, 2025Updated 7 months ago