Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"
☆17Feb 25, 2025Updated last year
Alternatives and similar repositories for IHEval
Users that are interested in IHEval are comparing it to the libraries listed below
Sorting:
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆28Feb 17, 2025Updated last year
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆13Jan 14, 2026Updated last month
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated last month
- Inferring Strange Behavior from Connectivity Pattern (PAKDD 2014, KAIS 2015)☆11Mar 27, 2015Updated 10 years ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 5 months ago
- 一个底层基于matrix的自动求导框架,并封装了一个DNN和一个RNN☆10Dec 3, 2020Updated 5 years ago
- ☆12Sep 22, 2023Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]☆10Jan 28, 2024Updated 2 years ago
- This repository is unmaintained, please see lumo for details.☆10Mar 19, 2023Updated 2 years ago
- Multi-Agent Reinforcement Learning☆11Jun 16, 2020Updated 5 years ago
- Python package for InfoAlign☆13Oct 14, 2024Updated last year
- ☆12Apr 12, 2024Updated last year
- Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller☆10Dec 5, 2017Updated 8 years ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆38Feb 21, 2026Updated 2 weeks ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 7 months ago
- Multi-Critic Policy Gradient Optimization for Quadcopter Coordination☆14Aug 10, 2021Updated 4 years ago
- DIgital Musicology Corpus Analysis Toolkit☆14Sep 4, 2025Updated 6 months ago
- A small, useless, self-replicating virus that injects itself into a windows executable binary. Upon execution, it will infect all the oth…☆14Dec 26, 2017Updated 8 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Mar 2, 2026Updated last week
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"☆17Feb 17, 2026Updated 3 weeks ago
- ☆11Nov 28, 2025Updated 3 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Oct 1, 2024Updated last year
- ☆25Sep 15, 2025Updated 5 months ago
- Compare how fine-tuned AI video models interpret the same prompts☆14Jan 29, 2025Updated last year
- Crawling Papers in S&P/CCS/USENIX Security/NDSS according to keywords.☆14May 12, 2025Updated 9 months ago
- ☆13Feb 18, 2024Updated 2 years ago
- ☆14Aug 15, 2024Updated last year
- This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)☆11Mar 24, 2018Updated 7 years ago
- Network Intrusion Detection System(Abnormal Detection)☆11Jun 14, 2020Updated 5 years ago
- source code for ICLR'24 paper "How does unlabeled data provably help OOD detection?"☆13Feb 1, 2024Updated 2 years ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆15Sep 23, 2024Updated last year
- [NeurIPS 2023] The official implementation of "Rethinking Semi-Supervised Imbalanced Node Classification from Bias-Variance Decomposition…☆12Oct 25, 2025Updated 4 months ago
- The official code for ``An Engorgio Prompt Makes Large Language Model Babble on''☆21Aug 9, 2025Updated 7 months ago
- ☆13Oct 21, 2021Updated 4 years ago
- ☆20Jun 16, 2025Updated 8 months ago