The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We develop a method for analyzing emerging functional modularity in neural networks based on differentiable weight masks and use it to point out important issues in current-day neural networks.
☆46Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for modules
Users that are interested in modules are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆43Feb 12, 2025Updated last year
- ☆35Mar 13, 2021Updated 5 years ago
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆21Jan 21, 2021Updated 5 years ago
- ☆10Jun 12, 2021Updated 4 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Apr 8, 2021Updated 5 years ago
- Learning perturbation sets for robust machine learning☆64Aug 23, 2021Updated 4 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- CVPR'19 experiments with (on-manifold) adversarial examples.☆43Feb 27, 2020Updated 6 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Jun 14, 2020Updated 5 years ago
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆17Jun 1, 2021Updated 4 years ago
- ☆34Apr 19, 2024Updated last year
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- A neural-symbolic joint reasoning approach for Natural Language Inference (NLI). Modeling NLI as inference path planning through a search…☆16Jun 9, 2021Updated 4 years ago
- Code to reproduce the results for Compositional Attention☆59Nov 16, 2022Updated 3 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- ☆12Aug 10, 2023Updated 2 years ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated last month
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Jul 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- Effective Abstract Reasoning with Dual-Contrast Network☆11Jun 25, 2021Updated 4 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- ☆68Mar 4, 2020Updated 6 years ago
- A library for efficient patching and automatic circuit discovery.☆95Dec 31, 2025Updated 3 months ago
- Code for "Supermasks in Superposition"☆126Oct 3, 2023Updated 2 years ago
- experiments for Course 'Advanced Operation System'.☆13Mar 19, 2019Updated 7 years ago
- GenMOS - A System for Generalized (off-the-shelf) 3D Multi-Object Search | ICRA 2023☆23May 17, 2023Updated 2 years ago
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Guidebook for IC PhD life at EPFL☆20Mar 2, 2026Updated last month
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 4 years ago
- ☆13Oct 30, 2024Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆44Sep 18, 2020Updated 5 years ago