krafton-ai / mini-batch-cl
☆11Updated last year
Alternatives and similar repositories for mini-batch-cl:
Users that are interested in mini-batch-cl are comparing it to the libraries listed below
- ☆16Updated last year
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆20Updated last year
- Domain Adaptation and Adapters☆16Updated 2 years ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated last year
- ☆66Updated 3 years ago
- ☆20Updated last year
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Updated 6 months ago
- ☆128Updated 2 years ago
- The git repository of Modular Prompted Chatbot paper☆33Updated last year
- Model Stock: All we need is just a few fine-tuned models☆113Updated 7 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated 11 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆25Updated 5 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated 10 months ago
- ☆34Updated last week
- KAIST AI605 Deep Learning for NLP☆31Updated 2 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Updated last year
- ☆9Updated last month
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆65Updated 7 months ago
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆67Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- "Learning Loss for Test-Time Augmentation (NeurIPS 2020)"☆9Updated 4 years ago
- ☆22Updated 10 months ago
- ☆29Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated last year
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆61Updated last year
- Position Prediction as an Effective Pretraining Strategy☆8Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆30Updated 9 months ago