princeton-pli / MeCoView external linksLinks
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆49Jun 30, 2025Updated 7 months ago
Alternatives and similar repositories for MeCo
Users that are interested in MeCo are comparing it to the libraries listed below
Sorting:
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆34Jan 20, 2026Updated 3 weeks ago
- ☆43Oct 13, 2023Updated 2 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10May 31, 2019Updated 6 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆11Apr 23, 2023Updated 2 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Keras Implementation of DDPG(Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆13Mar 25, 2023Updated 2 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- ☆220Oct 27, 2025Updated 3 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Jan 16, 2024Updated 2 years ago
- ☆71Oct 16, 2024Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆35Oct 26, 2025Updated 3 months ago
- SuperCLUE高考作文机器自动阅卷系统☆17Jun 8, 2023Updated 2 years ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Dec 22, 2024Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆20Jul 9, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- ☆18May 15, 2021Updated 4 years ago
- ☆33Jun 3, 2025Updated 8 months ago
- Official Code Repository for the paper "Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation …☆20Jun 19, 2023Updated 2 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- ☆48Jun 8, 2020Updated 5 years ago
- ☆53Apr 29, 2020Updated 5 years ago
- Official repository of the R2-D2's pipeline☆21Nov 16, 2021Updated 4 years ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆60Mar 4, 2025Updated 11 months ago
- Reproducible Language Agent Research☆33Jun 25, 2025Updated 7 months ago
- ☆24Jul 24, 2023Updated 2 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Aug 18, 2024Updated last year
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- ☆23Mar 31, 2023Updated 2 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- ☆21May 5, 2020Updated 5 years ago
- ☆26Dec 14, 2023Updated 2 years ago