mlfoundations / dataset2metadata
☆21Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for dataset2metadata
- Code for T-MARS data filtering☆35Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated last month
- Code for "Merging Text Transformers from Different Initializations"☆19Updated 3 months ago
- PyTorch building blocks for OLMo☆18Updated this week
- Influence Experiments☆35Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆44Updated last year
- ☆18Updated last month
- Repository for Skill Set Optimization☆12Updated 3 months ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆12Updated 4 months ago
- ☆18Updated 3 months ago
- ☆13Updated last year
- ☆25Updated 4 months ago
- ☆33Updated 7 months ago
- ☆29Updated 2 years ago
- ☆40Updated 2 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year
- ☆15Updated 4 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆18Updated 2 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- ☆18Updated 5 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- ☆71Updated 6 months ago