thu-ml / LM-Calibration
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LM-Calibration
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Updated 5 months ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Updated last year
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆29Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆11Updated 11 months ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆27Updated 3 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆28Updated last year
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆38Updated 3 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆35Updated 3 years ago
- Domain Adaptation and Adapters☆16Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆20Updated last year
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Updated 2 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆47Updated last year
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆30Updated 6 months ago
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆27Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆15Updated 6 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Mixture of Attention Heads☆39Updated 2 years ago
- ☆13Updated last year
- The code for lifelong few-shot language learning☆53Updated 2 years ago
- ☆29Updated 2 years ago
- ☆21Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆15Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- ☆19Updated last month
- "Learning Loss for Test-Time Augmentation (NeurIPS 2020)"☆9Updated 3 years ago
- [ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations☆14Updated last year