FutureComputing4AI / HrrformerLinks

Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)

☆58

Alternatives and similar repositories for Hrrformer

Users that are interested in Hrrformer are comparing it to the libraries listed below

Sorting:

ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
jysohn1108 / Looped-Transformer
Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…
☆27Updated 2 years ago
kazuki-irie / kv-memory-brain
Official Code Repository for the paper "Key-value memory in the brain"
☆27Updated 5 months ago
machine-discovery / deer
Parallelizing non-linear sequential models over the sequence length
☆53Updated last month
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated last year
fjzzq2002 / pizza
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆17Updated last year
bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆59Updated last year
IDSIA / rtrl-elstm
Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)
☆12Updated last month
emalach / LinearLM
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆20Updated last year
justinlovelace / Diffusion-Guided-LM
☆27Updated last year
samblouir / birdie
☆13Updated 2 months ago
dangxingyu / rnn-icrag
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Updated last year
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
sustcsonglin / mamba-triton
☆49Updated last year
ejmichaud / grokking-squared
☆26Updated 2 years ago
msakarvadia / AttentionLens
Interpretating the latent space representations of attention head outputs for LLMs
☆34Updated 11 months ago
HazyResearch / prefix-linear-attention
☆55Updated last year
jopetty / word-problem
Experiments on the impact of depth in transformers and SSMs.
☆32Updated 9 months ago
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
janphilippfranken / sami
Self-Supervised Alignment with Mutual Information
☆21Updated last year
gregorbachmann / Next-Token-Failures
☆89Updated last year
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
AndPotap / einsum-search
☆32Updated 10 months ago
berlino / seq_icl
☆53Updated last year
kdu4108 / semiring-backprop-exps
☆16Updated 2 years ago
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆23Updated 3 months ago
ekinakyurek / google-research
Google Research
☆46Updated 2 years ago
HEmile / a-nesi
A Scalable Approximate Method for Probabilistic Neurosymbolic Inference
☆15Updated 6 months ago
OpenNLPLab / HGRN
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆66Updated last year