Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆50Jun 30, 2025Updated 8 months ago
Alternatives and similar repositories for MeCo
Users that are interested in MeCo are comparing it to the libraries listed below
Sorting:
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆36Jan 20, 2026Updated last month
- ☆43Oct 13, 2023Updated 2 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10May 31, 2019Updated 6 years ago
- ☆11Apr 23, 2023Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- ☆71Oct 16, 2024Updated last year
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 2 years ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆20Feb 24, 2026Updated last week
- SuperCLUE高考作文机器自动阅卷系统☆17Jun 8, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- ☆18May 15, 2021Updated 4 years ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆20Feb 23, 2026Updated last week
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Official Code Repository for the paper "Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation …☆20Jun 19, 2023Updated 2 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- ☆35Jun 3, 2025Updated 9 months ago
- ☆48Jun 8, 2020Updated 5 years ago
- ☆53Apr 29, 2020Updated 5 years ago
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 8 months ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- ☆21May 5, 2020Updated 5 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆78May 2, 2025Updated 10 months ago
- Influence Estimation for Gradient-Boosted Decision Trees☆29May 27, 2024Updated last year
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 2 months ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Nov 26, 2021Updated 4 years ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 8 months ago