mahkons / Lottery-ticket-hypothesisLinks
This repository contains a Pytorch implementation of the article "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks" and an application of this hypothesis to reinforcement learning
☆9Updated 4 years ago
Alternatives and similar repositories for Lottery-ticket-hypothesis
Users that are interested in Lottery-ticket-hypothesis are comparing it to the libraries listed below
Sorting:
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 7 years ago
- ☆41Updated 4 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- ☆20Updated 5 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 4 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆28Updated 4 years ago
- ☆11Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆17Updated last year
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated last month
- ☆50Updated 4 years ago
- The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…☆36Updated 4 years ago
- "Towards Robust, Locally Linear Deep Networks" (ICLR 2019)☆9Updated 6 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 5 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Updated 6 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆64Updated 3 weeks ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- Code for "MIM: Mutual Information Machine" paper.☆16Updated 2 years ago
- Investigate the speed of adaptation of structural causal models☆15Updated 4 years ago
- lanmt ebm☆12Updated 5 years ago
- codes for TokenManipulationGAN☆7Updated 5 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 5 years ago
- Online Hyperparameter Optimization☆10Updated 4 years ago
- DILMA: Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers☆12Updated 4 years ago
- ☆25Updated last year