LumenPallidium / energy_transformerLinks

Pytorch implementation of an energy transformer - an energy-based reccurrent variant of the transformer.

☆13

Alternatives and similar repositories for energy_transformer

Users that are interested in energy_transformer are comparing it to the libraries listed below

Sorting:

bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆58Updated last year
enajx / HyperNCA
☆39Updated 3 years ago
Lemon-cmd / energy-transformer-graph
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…
☆24Updated last year
wattenberg / superposition
Code associated to papers on superposition (in ML interpretability)
☆28Updated 2 years ago
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆35Updated 10 months ago
LumenPallidium / backprop-alts
This repository has implementations of various alternatives to backpropagation for training neural networks.
☆22Updated 5 months ago
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago
leonard-gleyzer / connex
Fine-grained, dynamic control of neural network topology in JAX.
☆21Updated last year
thebuckleylab / jpc
Flexible Inference for Predictive Coding Networks in JAX.
☆48Updated 3 weeks ago
google-deepmind / neural_networks_solomonoff_induction
Learning Universal Predictors
☆76Updated 10 months ago
hadivafaii / IterativeVAE
Brain-like variational inference
☆51Updated last month
EleutherAI / features-across-time
Understanding how features learned by neural networks evolve throughout training
☆35Updated 8 months ago
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
fjzzq2002 / pizza
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆17Updated last year
DimaKrotov / Dense_Associative_Memory
Example of Dense Associative Memory training on MNIST
☆36Updated 2 years ago
KindXiaoming / Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
☆58Updated 2 years ago
KindXiaoming / physics_of_skill_learning
We study toy models of skill learning.
☆28Updated 5 months ago
louiskirsch / vsml-neurips2021
Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905
☆32Updated 3 years ago
bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
shikaiqiu / compute-better-spent
☆53Updated 8 months ago
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆87Updated 3 months ago
TrentBrick / SDMContinualLearner
☆17Updated 2 years ago
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆29Updated last year
IDSIA / rtrl-elstm
Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)
☆10Updated 2 weeks ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆52Updated last year
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆78Updated 2 years ago
apartresearch / Neuron2Graph
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
☆20Updated last year
AndPotap / einsum-search
☆32Updated 8 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆74Updated 7 months ago
jysohn1108 / Looped-Transformer
Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…
☆27Updated 2 years ago