noranta4 / ASIFLinks
Personal implementation of ASIF by Antonio Norelli
☆25Updated last year
Alternatives and similar repositories for ASIF
Users that are interested in ASIF are comparing it to the libraries listed below
Sorting:
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated 2 years ago
- ☆51Updated last year
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago
- Recycling diverse models☆45Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆45Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Official repo for the paper "Weight-based Decomposition: A Case for Bilinear MLPs"☆22Updated 7 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆54Updated last year
- ☆107Updated last year
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆151Updated 2 years ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 9 months ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆67Updated 3 months ago
- 👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)☆65Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- ☆99Updated 5 months ago
- A centralized place for deep thinking code and experiments☆85Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- ☆53Updated 9 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆90Updated last year
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated last year
- simple bibtex generator for any text with \cite{}☆31Updated last year
- Codebase for Mechanistic Mode Connectivity☆15Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆27Updated last year
- ☆22Updated 6 months ago
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆41Updated 2 years ago
- ☆31Updated last year