Junjie-Ye / MulDimIFLinks
A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆12Updated 2 weeks ago
Alternatives and similar repositories for MulDimIF
Users that are interested in MulDimIF are comparing it to the libraries listed below
Sorting:
- ☆16Updated 10 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 8 months ago
- Control LLM☆14Updated 2 months ago
- ☆20Updated 7 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- ☆14Updated 3 weeks ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆23Updated 3 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆21Updated 3 months ago
- ☆14Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 7 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆10Updated 4 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 4 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- ☆10Updated 3 years ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 2 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆22Updated this week
- The repository contains code for Adaptive Data Optimization☆24Updated 5 months ago
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆25Updated last month
- Code for "RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆20Updated 2 months ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Updated 3 years ago
- ☆17Updated 2 months ago
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.☆31Updated 6 months ago
- Unsupervised GRPO☆24Updated last week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Updated 3 years ago