The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We develop a method for analyzing emerging functional modularity in neural networks based on differentiable weight masks and use it to point out important issues in current-day neural networks.
☆46Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for modules
Users that are interested in modules are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 13, 2021Updated 4 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆43Feb 12, 2025Updated last year
- ☆35Mar 13, 2021Updated 5 years ago
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆23Jan 21, 2021Updated 5 years ago
- ☆10Jun 12, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Offical Repo for Splitting Steepest Descent for Growing Neural Architectures☆13May 12, 2021Updated 5 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- ☆14Apr 8, 2021Updated 5 years ago
- Learning perturbation sets for robust machine learning☆64Aug 23, 2021Updated 4 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆15Sep 26, 2022Updated 3 years ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆68Aug 15, 2025Updated 9 months ago
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆17Jun 1, 2021Updated 4 years ago
- ☆34Apr 19, 2024Updated 2 years ago
- [NeurIPS 2020 Oral] Is normalization indispensable for training deep neural networks?☆34Jun 22, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ShapeGuard is a small tool to help with handling shapes in Tensorflow.☆17Sep 17, 2019Updated 6 years ago
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆174May 4, 2024Updated 2 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated 11 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- A neural-symbolic joint reasoning approach for Natural Language Inference (NLI). Modeling NLI as inference path planning through a search…☆16Jun 9, 2021Updated 4 years ago
- Code to reproduce the results for Compositional Attention☆59Nov 16, 2022Updated 3 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Aug 20, 2019Updated 6 years ago
- Code for the "Binding via Reconstruction Clustering" paper☆21Jan 19, 2016Updated 10 years ago
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆71Jun 5, 2020Updated 5 years ago
- ☆68Mar 4, 2020Updated 6 years ago
- experiments for Course 'Advanced Operation System'.☆13Mar 19, 2019Updated 7 years ago
- GenMOS - A System for Generalized (off-the-shelf) 3D Multi-Object Search | ICRA 2023☆22May 17, 2023Updated 3 years ago
- Code for the ICML 2019 paper 'Conditioning by adaptive sampling for robust design'☆37Jun 4, 2021Updated 4 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Android releases of Clubhouse App☆14Apr 9, 2021Updated 5 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- Reduced-order modelling using an atlas of charts☆28Oct 18, 2022Updated 3 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Jun 28, 2019Updated 6 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Jun 28, 2024Updated last year
- Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.☆10Dec 19, 2018Updated 7 years ago
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated 2 years ago