fredzzhang / atlas
Official PyTorch implementation for NeurIPS'24 paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"
☆11Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for atlas
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆11Updated last year
- Official implementation for Sparse MetA-Tuning (SMAT)☆14Updated 4 months ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆28Updated last year
- ☆28Updated last year
- ☆26Updated 2 years ago
- ☆22Updated this week
- Code for "Merging Text Transformers from Different Initializations"☆19Updated 3 months ago
- Recycling diverse models☆44Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆13Updated 7 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆44Updated 5 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated last year
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- Gradient-based Hyperparameter Optimization Over Long Horizons☆12Updated 3 years ago
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 5 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆12Updated 2 months ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆20Updated last year
- ☆15Updated 4 months ago
- Official implementation of the paper "Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Perform…☆19Updated last year
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- ☆17Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆14Updated 3 weeks ago
- ☆14Updated 11 months ago
- An adaptive training algorithm for residual network☆14Updated 4 years ago
- Structured Pruning Adapters in PyTorch☆15Updated last year
- Code for T-MARS data filtering☆35Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 7 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆15Updated 2 weeks ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 7 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago