datamllab / awsome-LLM-generated-text-detection
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for awsome-LLM-generated-text-detection
- ☆12Updated last year
- Official code repository for Mixset.☆21Updated 3 weeks ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆43Updated last year
- Can AI-Generated Text be Reliably Detected?☆62Updated last year
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆50Updated 5 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆60Updated 8 months ago
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆41Updated 2 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆76Updated last year
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆46Updated last year
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆16Updated 3 months ago
- Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…☆85Updated last week
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆19Updated 6 months ago
- ☆32Updated 6 months ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆122Updated last year
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆52Updated last week
- Interpretable unified language safety checking with large language models☆30Updated last year
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆61Updated last month
- This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluatio…☆77Updated 11 months ago
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆33Updated 11 months ago
- ☆24Updated 11 months ago
- Can We Trust Large Language Models?: A Benchmark for Responsible Large Language Models via Toxicity, Bias, and Value-alignment Evaluation☆24Updated last year
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated last year
- [NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li☆19Updated 2 years ago
- ☆19Updated last year
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆11Updated 6 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆33Updated this week
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆29Updated last month
- ☆22Updated 2 years ago
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆20Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated last month