ahnjaewoo / timechara
π§π»Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"
β18Updated 2 months ago
Related projects β
Alternatives and complementary repositories for timechara
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β21Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Modelsβ76Updated 2 months ago
- π μμΈλ μ»΄ν¨ν°κ³΅νλΆ (컴곡) νμ λ Όλ¬Έ ν νλ¦Ώ | Thesis template for SNU CSEβ10Updated last year
- β26Updated last year
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β11Updated 5 months ago
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)β29Updated 5 months ago
- Repository (preliminary codes) for DSTC10 SIMMC track.β19Updated last year
- β23Updated 11 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image featβ¦β43Updated 3 months ago
- [EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Promptβ20Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Modelsβ66Updated 6 months ago
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewardsβ44Updated 6 months ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Updated 2 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β12Updated last year
- β74Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Modelsβ93Updated 2 years ago
- β11Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.β57Updated 2 weeks ago
- β15Updated 2 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.β31Updated 4 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)β15Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.β18Updated last year
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)β21Updated 4 months ago
- The git repository of Modular Prompted Chatbot paperβ33Updated last year
- β24Updated last year
- β20Updated 4 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.β10Updated 8 months ago
- Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)β21Updated 6 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Othersβ26Updated last month
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β51Updated 5 months ago