azreasoners / recurrent_transformer

☆8

Alternatives and similar repositories for recurrent_transformer:

Users that are interested in recurrent_transformer are comparing it to the libraries listed below

albertqjiang / INT
Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
☆39Updated 2 years ago
AllanYangZhou / universal_neural_functional
☆49Updated last year
aw31 / empirical-ntks
Efficient empirical NTKs in PyTorch
☆18Updated 2 years ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆36Updated 2 years ago
GFNOrg / GFN_vs_HVI
☆9Updated 2 years ago
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆106Updated last year
google-deepmind / exedec
☆12Updated last year
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆78Updated 2 years ago
IdoAmos / not-from-scratch
☆31Updated 6 months ago
hartvigsen-group / composable-interventions
☆28Updated 2 months ago
fjzzq2002 / pizza
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆15Updated last year
KihoPark / linear_rep_geometry
☆92Updated 2 months ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆60Updated last year
facebookresearch / LAWT
Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)
☆68Updated 8 months ago
HEmile / a-nesi
A Scalable Approximate Method for Probabilistic Neurosymbolic Inference
☆15Updated 3 months ago
Johswald / awesome-hypernetworks
☆62Updated 3 years ago
linlu-qiu / lm-inductive-reasoning
☆34Updated last year
KSB21ST / MINI-ARC
☆33Updated last year
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆28Updated 11 months ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆61Updated last year
khalil-research / ARGA-AAAI23
Abstract Reasoning with Graph Abstractions (ARGA) implementation
☆61Updated 10 months ago
albertqjiang / draft_sketch_prove
☆67Updated last year
mansheej / icl-task-diversity
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆21Updated last year
GFNOrg / EB_GFN
Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"
☆82Updated 2 years ago
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆58Updated last year
HKUNLP / subgoal-theorem-prover
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆19Updated last year
joshuacnf / paradox-learning2reason
☆34Updated 4 months ago
KareemYousrii / SPL
This repository holds the code for the NeurIPS 2022 paper, Semantic Probabilistic Layers
☆27Updated last year
bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆57Updated last year