noranta4 / ASIF
Personal implementation of ASIF by Antonio Norelli
☆25Updated 10 months ago
Alternatives and similar repositories for ASIF:
Users that are interested in ASIF are comparing it to the libraries listed below
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆49Updated 10 months ago
- Official repo for the paper "Weight-based Decomposition: A Case for Bilinear MLPs"☆20Updated 4 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Official PyTorch implementation for NeurIPS'24 paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆19Updated last month
- ☆44Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆29Updated last year
- Implementation of Bitune: Bidirectional Instruction-Tuning☆19Updated 9 months ago
- Recycling diverse models☆44Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆95Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆63Updated 6 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆86Updated last year
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated 2 years ago
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆40Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆56Updated last year
- ☆52Updated 5 months ago
- Deep Networks Grok All the Time and Here is Why☆31Updated 10 months ago
- ☆28Updated 8 months ago
- ☆108Updated last year
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆38Updated 4 months ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆62Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆38Updated 5 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆54Updated 3 months ago
- Holistic evaluation of multimodal foundation models☆43Updated 7 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 11 months ago
- ☆90Updated last month