ahnjaewoo / timechara
π§π»Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"
β18Updated last month
Related projects β
Alternatives and complementary repositories for timechara
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β21Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Modelsβ75Updated last month
- π μμΈλ μ»΄ν¨ν°κ³΅νλΆ (컴곡) νμ λ Όλ¬Έ ν νλ¦Ώ | Thesis template for SNU CSEβ10Updated last year
- β26Updated last year
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)β29Updated 5 months ago
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β11Updated 5 months ago
- β74Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Modelsβ93Updated 2 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.β51Updated this week
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β12Updated last year
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Updated last month
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.β31Updated 4 months ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)β21Updated 4 months ago
- β16Updated last year
- Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)β21Updated 6 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Modelsβ47Updated 11 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversityβ38Updated 9 months ago
- β22Updated 8 months ago
- [EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Promptβ20Updated last year
- Official code for the paper: Evaluating Copyright Takedown Methods for Language Modelsβ15Updated 3 months ago
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contextsβ14Updated 2 months ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Modelsβ66Updated 5 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image featβ¦β42Updated 2 months ago
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineeringβ21Updated 2 weeks ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Larβ¦β101Updated 9 months ago
- β13Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Modelβ62Updated 2 years ago
- Active Example Selection for In-Context Learning (EMNLP'22)β45Updated 3 months ago
- Repository (preliminary codes) for DSTC10 SIMMC track.β19Updated last year
- GPT as Humanβ18Updated 9 months ago