alexiglad / EBTView external linksLinks
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆597Nov 12, 2025Updated 3 months ago
Alternatives and similar repositories for EBT
Users that are interested in EBT are comparing it to the libraries listed below
Sorting:
- ☆88Jun 14, 2024Updated last year
- Official repository for the paper "Flow Equivariant Recurrent Neural Networks"☆31Jul 2, 2025Updated 7 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆117Sep 22, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆70Jan 9, 2026Updated last month
- ☆53Oct 29, 2025Updated 3 months ago
- Versatile human antibody sequence design☆21May 27, 2025Updated 8 months ago
- Work in progress.☆79Nov 25, 2025Updated 2 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆198Dec 18, 2025Updated last month
- ☆20Oct 22, 2025Updated 3 months ago
- ☆18Jun 19, 2025Updated 7 months ago
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆619Sep 29, 2025Updated 4 months ago
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated last month
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆109Oct 22, 2025Updated 3 months ago
- Python wrapper for lean-gym☆12Apr 5, 2023Updated 2 years ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- [ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"☆29Jun 2, 2025Updated 8 months ago
- Code for BLT research paper☆2,028Nov 3, 2025Updated 3 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Massively-Parallel Natural Extension of Reference Frame☆33Jan 18, 2023Updated 3 years ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- ☆215Jan 5, 2026Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language model☆863Dec 29, 2025Updated last month
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,187Jan 30, 2025Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,761Dec 29, 2025Updated last month
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Oct 27, 2024Updated last year
- Code for "SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation" @ ICML 2022☆17Jan 10, 2023Updated 3 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated 11 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,521Feb 27, 2025Updated 11 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆132Dec 3, 2024Updated last year
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Mar 4, 2023Updated 2 years ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆232Jan 26, 2026Updated 2 weeks ago
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆950Jul 10, 2025Updated 7 months ago
- PoE-World: Compositional World Modeling with Products of Programmatic Experts☆39Feb 5, 2026Updated last week
- ☆20Oct 3, 2024Updated last year