RobertCsordas/ndr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RobertCsordas/ndr)

RobertCsordas / ndr

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

☆34

Alternatives and similar repositories for ndr

Users that are interested in ndr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ekinakyurek / lexical
View on GitHub
Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling
☆17Jan 8, 2022Updated 4 years ago
OliverRichter / normalized-attention
View on GitHub
Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
RobertCsordas / moe_layer
View on GitHub
sigma-MoE layer
☆21Jan 5, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
ekinakyurek / compgen
View on GitHub
Paper: Learning to Recombine and Resample Data for Compositional Generalization
☆11Oct 9, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RobertCsordas / linear_layer_as_attention
View on GitHub
The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …
☆16Jun 11, 2025Updated last year
JRC1995 / Continuous-RvNN
View on GitHub
Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)
☆12Aug 18, 2021Updated 4 years ago
GitGyun / chameleon
View on GitHub
[ECCV'24 Oral] Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
☆13Mar 13, 2025Updated last year
Zcchill / Value-Residual-Learning
View on GitHub
☆15Mar 20, 2025Updated last year
nshepperd / gumbel-rao-pytorch
View on GitHub
☆11Jul 25, 2021Updated 5 years ago
i-machine-think / machine-tasks
View on GitHub
Datasets for compositional learning
☆11Nov 28, 2018Updated 7 years ago
swiseman / neighbor-splicing
View on GitHub
☆11Jan 2, 2022Updated 4 years ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
jacobandreas / geca
View on GitHub
☆41Jan 11, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
CyndxAI / QKNorm
View on GitHub
Code for the paper "Query-Key Normalization for Transformers"
☆53Mar 6, 2021Updated 5 years ago
ashwindcruz / dgm
View on GitHub
Deep Generative Models (Chainer)
☆10Oct 12, 2017Updated 8 years ago
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
1Konny / HVP
View on GitHub
PyTorch implementation of our paper, "Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction."
☆21Feb 10, 2021Updated 5 years ago
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
amirzandieh / HyperAttention
View on GitHub
Triton Implementation of HyperAttention Algorithm
☆48Dec 11, 2023Updated 2 years ago
srush / torch-golf
View on GitHub
Silly twitter torch implementations.
☆48Oct 14, 2022Updated 3 years ago
dguo98 / SeqMix
View on GitHub
Sequence-Level Mixed Sample Data Augmentation
☆23Mar 7, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
srush / ProbTalk
View on GitHub
☆29Nov 30, 2021Updated 4 years ago
SeongwoongCho / adversarial-autoaugment-pytorch
View on GitHub
Unofficial Pytorch Implementation Of AdversarialAutoAugment(ICLR2020)
☆21Feb 9, 2021Updated 5 years ago
diprism / fggs
View on GitHub
Factor Graph Grammars in Python
☆14Jan 17, 2026Updated 6 months ago
turboLJY / Transfer-Prompts-for-Text-Generation
View on GitHub
☆16Aug 14, 2022Updated 3 years ago
allenai / gpv2
View on GitHub
☆32Mar 7, 2022Updated 4 years ago
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
kazuki-irie / kv-memory-brain
View on GitHub
Official Code Repository for the paper "Key-value memory in the brain"
☆32Feb 25, 2025Updated last year
lucidrains / ponder-transformer
View on GitHub
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆84Oct 30, 2021Updated 4 years ago
ldmt-muri / alignment-with-openfst
View on GitHub
☆21Dec 9, 2016Updated 9 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sustcsonglin / second-order-neural-dmv
View on GitHub
source code of COLING2020 "Second-Order Unsupervised Neural Dependency Parsing"
☆16Oct 24, 2022Updated 3 years ago
LouChao98 / nner_as_parsing
View on GitHub
☆16Mar 22, 2023Updated 3 years ago
OpenNLPLab / ETSC-Exact-Toeplitz-to-SSM-Conversion
View on GitHub
[EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…
☆14Oct 17, 2023Updated 2 years ago
longwind48 / convo-miner
View on GitHub
Mine conversations from novels in Project Gutenberg, to generate data for data-driven dialogue systems.
☆15May 7, 2019Updated 7 years ago
peterbhase / ExplanationSearch
View on GitHub
Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"
☆18Oct 17, 2022Updated 3 years ago
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
FranxYao / Distributional-Generalization-in-Natural-Language-Processing
View on GitHub
Distributional Generalization in NLP. A roadmap.
☆86Dec 12, 2022Updated 3 years ago