ducminhkhoi / autograd_pytorchLinks
Building your own autograd mechanism based on PyTorch tensor only (not Variable, can be seen as numpy array)
☆22Updated last year
Alternatives and similar repositories for autograd_pytorch
Users that are interested in autograd_pytorch are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 4 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆50Updated 2 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Nonparametric Score Estimators, ICML 2020☆36Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- ☆29Updated 2 years ago
- Neuronal Circuit Policies☆40Updated 3 years ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning☆23Updated 2 years ago
- Google Research☆45Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 5 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 7 years ago
- An adaptive training algorithm for residual network☆17Updated 5 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆31Updated 2 years ago
- EMNLP 2020: On the Ability and Limitations of Transformers to Recognize Formal Languages☆24Updated 4 years ago
- A supplementary code for Editable Neural Networks, an ICLR 2020 submission.☆46Updated 5 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 5 years ago
- Investigate the speed of adaptation of structural causal models☆15Updated 4 years ago
- A Learnable LSH Framework for Efficient NN Training☆32Updated 4 years ago
- Fork of diux-dev/imagenet18☆16Updated 6 years ago