mlfoundations / dataset2metadata
☆20Updated 5 months ago
Related projects: ⓘ
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- ☆40Updated 2 years ago
- ☆33Updated 5 months ago
- Influence Experiments☆36Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆87Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆18Updated last month
- Code for T-MARS data filtering☆34Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆30Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- ☆28Updated last year
- ☆14Updated 6 months ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆50Updated last year
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆52Updated 3 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆26Updated 10 months ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆11Updated 2 months ago
- Tasks for describing differences between text distributions.☆15Updated last month
- ☆15Updated 2 months ago
- Official Repository for Dataset Inference for LLMs☆21Updated last month
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆15Updated 10 months ago
- ☆22Updated last year
- ☆17Updated this week
- Data Valuation on In-Context Examples (ACL23)☆22Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆68Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- Using FlexAttention to compute attention with different masking patterns☆28Updated last week
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated 11 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆77Updated last year
- This repository contains data, code and models for contextual noncompliance.☆17Updated 2 months ago
- ☆14Updated last month