Junjie-Ye / MulDimIFLinks
A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆14Updated last month
Alternatives and similar repositories for MulDimIF
Users that are interested in MulDimIF are comparing it to the libraries listed below
Sorting:
- ☆16Updated 11 months ago
- Control LLM☆17Updated 3 months ago
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆25Updated 2 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Codebase for Instruction Following without Instruction Tuning☆35Updated 9 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 9 months ago
- ☆16Updated 2 weeks ago
- ☆14Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Updated 2 years ago
- ☆20Updated 8 months ago
- ☆20Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 9 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆10Updated 6 months ago
- A comprehensive and efficient long-context model evaluation framework☆15Updated this week
- ☆27Updated 2 years ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 5 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆26Updated 4 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 4 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆22Updated 4 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated last year
- ☆13Updated 2 weeks ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆28Updated 3 weeks ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆11Updated 2 years ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- ☆15Updated 8 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆25Updated 3 months ago
- Generating Information-Seeking Conversations from Unlabeled Documents (EMNLP 2022).☆11Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆20Updated last year