hamishivi/automated-instruction-selection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hamishivi/automated-instruction-selection)

hamishivi / automated-instruction-selection

Exploration of automated dataset selection approaches at large scales.

☆55

Alternatives and similar repositories for automated-instruction-selection

Users that are interested in automated-instruction-selection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
zhuang-li / SCAR
View on GitHub
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Mode…
☆39Aug 6, 2025Updated 11 months ago
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
Jiachen-T-Wang / GREATS
View on GitHub
☆20Jun 27, 2026Updated last month
ZhentingWang / DUMP
View on GitHub
☆33May 9, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
SeyedAlirezaFatemi / decaf-compiler
View on GitHub
A compiler for decaf programming language written in python using Lark. Target language is MIPS.
☆12Aug 10, 2020Updated 5 years ago
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
IST-DASLab / sparseprop
View on GitHub
☆16Sep 27, 2023Updated 2 years ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mangeneh / akkaskhooneh-frontend
View on GitHub
Akkaskhooneh is a social media application which provides some features of Instagram and some features of Pinterest.
☆12Nov 14, 2018Updated 7 years ago
IST-DASLab / peft-rosa
View on GitHub
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
☆15Aug 16, 2024Updated last year
upiterbarg / diff_history
View on GitHub
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Aug 20, 2024Updated last year
upiterbarg / lintseq
View on GitHub
[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)
☆19Feb 11, 2025Updated last year
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 7 months ago
GAIR-NLP / LIMR
View on GitHub
☆221Feb 20, 2025Updated last year
yichengchen24 / MIG
View on GitHub
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…
☆28Aug 30, 2025Updated 10 months ago
r-three / AttriBoT
View on GitHub
Code for AttriBoT from "AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution"
☆15Apr 21, 2025Updated last year
anpaure / cp_eval
View on GitHub
Tiny evaluation of leading LLMs on competitive programming problems
☆14Apr 10, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TIGER-AI-Lab / CritiqueFineTuning
View on GitHub
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆182Jul 8, 2025Updated last year
stellalisy / alfa
View on GitHub
Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
☆18Feb 21, 2025Updated last year
vuminhduc796 / GPTVoiceTasker
View on GitHub
☆12Apr 2, 2024Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
clinicalml / cotrain-prompting
View on GitHub
Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance
☆16Sep 23, 2022Updated 3 years ago
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
InternLM / OREAL
View on GitHub
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Mar 20, 2025Updated last year
sail-sg / sdft
View on GitHub
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆167Nov 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
anadim / llm-benchmark-matrix
View on GitHub
Cited 83-model x 49-benchmark LLM evaluation matrix with 18 matrix completion methods
☆39Feb 25, 2026Updated 5 months ago
nuprl / MultiPL-T
View on GitHub
Knowledge transfer from high-resource to low-resource programming languages for Code LLMs
☆17Aug 12, 2025Updated 11 months ago
belindal / TaskBench500
View on GitHub
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Jul 16, 2022Updated 4 years ago
JTWang2000 / NICE
View on GitHub
NICE: Non-differentiable evaluation metric-based InfluenCe Estimation
☆16Jul 7, 2025Updated last year
YJiangcm / BMC
View on GitHub
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Jan 26, 2025Updated last year
paul-rottger / issuebench
View on GitHub
Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"
☆17Mar 6, 2026Updated 4 months ago