allenai / noncomplianceView external linksLinks
This repository contains data, code and models for contextual noncompliance.
☆25Jul 18, 2024Updated last year
Alternatives and similar repositories for noncompliance
Users that are interested in noncompliance are comparing it to the libraries listed below
Sorting:
- ☆15Jul 9, 2025Updated 7 months ago
- ☆19Oct 2, 2023Updated 2 years ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆24Jul 22, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- ☆11Jun 21, 2025Updated 7 months ago
- ☆31Oct 2, 2024Updated last year
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types -- Supplementary inf…☆12Jul 14, 2020Updated 5 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Oct 7, 2025Updated 4 months ago
- ☆31Nov 18, 2025Updated 2 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆19May 24, 2025Updated 8 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆18Oct 11, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆18Dec 23, 2024Updated last year
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- A holistic benchmark for LLM abstention☆69Aug 27, 2025Updated 5 months ago
- R3: Robust Rubric-Agnostic Reward Models☆20Jul 12, 2025Updated 7 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- Official repository of DialSim☆28Oct 31, 2025Updated 3 months ago
- ☆28Apr 22, 2025Updated 9 months ago
- ☆16Jul 23, 2024Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- ☆21May 3, 2025Updated 9 months ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆34Aug 28, 2025Updated 5 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Jan 29, 2024Updated 2 years ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆104Jan 14, 2026Updated last month
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- ☆22Sep 2, 2025Updated 5 months ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆127Feb 24, 2025Updated 11 months ago
- ☆25Nov 19, 2025Updated 2 months ago
- ☆21May 24, 2024Updated last year
- Evaluating the faithfulness of long-context language models☆30Oct 21, 2024Updated last year
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year