allenai/easy-to-hard-generalization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/easy-to-hard-generalization)

allenai / easy-to-hard-generalization

Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"

☆48

Alternatives and similar repositories for easy-to-hard-generalization

Users that are interested in easy-to-hard-generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Re-Align / AlignTDS
View on GitHub
Analyzing LLM Alignment via Token distribution shift
☆17Jan 26, 2024Updated 2 years ago
ruiqi-zhong / nlparam
View on GitHub
Augmenting Statistical Models with Natural Language Parameters
☆28Sep 17, 2024Updated last year
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shentianxiao / FiLM
View on GitHub
☆13Oct 18, 2023Updated 2 years ago
xbmxb / EnvDistraction
View on GitHub
☆24Oct 11, 2024Updated last year
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
facebookresearch / iclmlp
View on GitHub
Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"
☆18May 29, 2023Updated 3 years ago
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
ducdauge / sft-llm
View on GitHub
Scaling Sparse Fine-Tuning to Large Language Models
☆19Jan 31, 2024Updated 2 years ago
mlepori1 / NeuroSurgeon
View on GitHub
NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers
☆43Feb 12, 2025Updated last year
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
hexuandeng / Mono4SiMT
View on GitHub
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Jul 19, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GAIR-NLP / OPO
View on GitHub
☆50Mar 2, 2024Updated 2 years ago
ethancaballero / broken_neural_scaling_laws
View on GitHub
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Oct 29, 2023Updated 2 years ago
liziniu / ReMax
View on GitHub
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
☆202Dec 16, 2023Updated 2 years ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
lyan62 / FoodieQA
View on GitHub
Official Repo for FoodieQA paper (EMNLP 2024)
☆20Jun 26, 2025Updated last year
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
yizhongw / llm-temporal-alignment
View on GitHub
Methods and evaluation for aligning language models temporally
☆31Mar 2, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dannyallover / overthinking_the_truth
View on GitHub
☆29Apr 30, 2024Updated 2 years ago
ruiqi-zhong / DescribeDistributionalDifferences
View on GitHub
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆43Feb 24, 2023Updated 3 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
cjyaras / deep-lora-transformers
View on GitHub
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆12Jul 22, 2024Updated 2 years ago
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆471Apr 18, 2024Updated 2 years ago
lucidrains / self-rewarding-lm-pytorch
View on GitHub
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,411Apr 11, 2024Updated 2 years ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
jiangycTarheel / SQ-Transformer
View on GitHub
☆10Feb 12, 2024Updated 2 years ago
wxjiao / InstructMT
View on GitHub
A collection of instruction data and scripts for machine translation.
☆20Sep 23, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ruiqi-zhong / D5
View on GitHub
The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions
☆72Mar 26, 2023Updated 3 years ago
Linear95 / DSP
View on GitHub
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
psunlpgroup / ReaLMistake
View on GitHub
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆32Aug 18, 2024Updated last year
zhiyuanhubj / LongRecipe
View on GitHub
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
☆79Oct 16, 2024Updated last year
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
cooelf / CompassMTL
View on GitHub
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Oct 17, 2022Updated 3 years ago
Edward-Sun / easy-to-hard
View on GitHub
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆124Sep 9, 2024Updated last year