The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions
☆72Mar 26, 2023Updated 2 years ago
Alternatives and similar repositories for D5
Users that are interested in D5 are comparing it to the libraries listed below
Sorting:
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40May 24, 2023Updated 2 years ago
- Augmenting Statistical Models with Natural Language Parameters☆28Sep 17, 2024Updated last year
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆13Jul 16, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- ☆10Mar 1, 2025Updated last year
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41Mar 2, 2026Updated 2 weeks ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- ☆21Jan 15, 2024Updated 2 years ago
- [ACL2025 Best Paper] Language Models Resist Alignment☆44Jun 11, 2025Updated 9 months ago
- ☆29Nov 16, 2025Updated 4 months ago
- AbstainQA, ACL 2024☆29Feb 4, 2026Updated last month
- Data and Code for StructuredRegex.☆15Nov 16, 2023Updated 2 years ago
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- ☆25May 16, 2024Updated last year
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 8 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆184Oct 28, 2022Updated 3 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- ☆25May 23, 2022Updated 3 years ago
- ☆17Feb 20, 2023Updated 3 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆95Dec 19, 2024Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Mar 7, 2025Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆70Feb 5, 2024Updated 2 years ago
- ☆64Feb 4, 2024Updated 2 years ago
- ☆13Oct 7, 2024Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆35Aug 21, 2025Updated 6 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆25Jun 17, 2025Updated 9 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆243May 5, 2024Updated last year
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- ☆68Jun 27, 2022Updated 3 years ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆236Aug 2, 2024Updated last year
- ☆18Mar 23, 2025Updated 11 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year