Junjie-Ye / MulDimIFLinks

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

☆14

Alternatives and similar repositories for MulDimIF

Users that are interested in MulDimIF are comparing it to the libraries listed below

Sorting:

yale-nlp / refdpo
☆16Updated 11 months ago
linkedin / ControlLLM
Control LLM
☆17Updated 3 months ago
general-preference / general-preference-model
Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…
☆25Updated 2 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated 9 months ago
duykhuongnguyen / LASeR-MAB
Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"
☆13Updated 9 months ago
Leezekun / MacRAG
☆16Updated 2 weeks ago
googleinterns / localizing-paragraph-memorization
☆14Updated last year
renll / SparseLT
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Updated 2 years ago
yifeiwang77 / Self-Correction
☆20Updated 8 months ago
formll / resolving-scaling-law-discrepancies
☆20Updated last year
sanyalsunny111 / Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
☆16Updated 9 months ago
UCSB-NLP-Chang / Prereq_tune
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆10Updated 6 months ago
LCM-Lab / LOOM-Scope
A comprehensive and efficient long-context model evaluation framework
☆15Updated this week
ctlllll / reward_collapse
☆27Updated 2 years ago
YJiangcm / BMC
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Updated 5 months ago
yihedeng9 / DuoGuard
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
☆26Updated 4 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆26Updated 4 months ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
THUDM / DataSciBench
DataSciBench: An LLM Agent Benchmark for Data Science
☆22Updated 4 months ago
maszhongming / ParaKnowTransfer
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆32Updated last year
kaiyuhwang / MLLM-Survey
The paper list of multilingual pre-trained models (Continual Updated).
☆22Updated last year
Hritikbansal / jpo
☆13Updated 2 weeks ago
wlzhang2020 / ReasonRAG
Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆28Updated 3 weeks ago
jingtaozhan / extrapolate-eval
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆11Updated 2 years ago
wangskyGit / passage-sieve
official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization
☆13Updated last year
wutaiqiang / LLM_KD_AKL
☆15Updated 8 months ago
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆25Updated 3 months ago
naver-ai / simseek
Generating Information-Seeking Conversations from Unlabeled Documents (EMNLP 2022).
☆11Updated 2 years ago
janphilippfranken / sami
Self-Supervised Alignment with Mutual Information
☆20Updated last year