Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
β263Updated 4 months ago
Alternatives and similar repositories for Awesome_papers_on_LLMs_detection
Users that are interested in Awesome_papers_on_LLMs_detection are comparing it to the libraries listed below
Sorting:
- UP-TO-DATE LLM Watermark paper. π₯π₯π₯β338Updated 5 months ago
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.β219Updated 3 weeks ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, currentβ¦β73Updated 5 months ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenariosβ23Updated 5 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, currentβ¦β216Updated 4 months ago
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Surveyβ94Updated 9 months ago
- LLM Unlearningβ157Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.β90Updated last year
- β48Updated 11 months ago
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"β91Updated 8 months ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)β65Updated last week
- γACL 2024γ SALAD benchmark & MD-Judgeβ145Updated 2 months ago
- The code implementation of the paper CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learniβ¦β15Updated last year
- Code for watermarking language modelsβ79Updated 8 months ago
- Accepted by ECCV 2024β129Updated 7 months ago
- β153Updated 3 months ago
- A survey on harmful fine-tuning attack for large language modelβ170Updated last week
- An LLM can Fool Itself: A Prompt-Based Adversarial Attack (ICLR 2024)β85Updated 3 months ago
- β25Updated 10 months ago
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Textβ29Updated last year
- β55Updated 2 months ago
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially β¦β43Updated last month
- Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversariaβ¦β50Updated 2 years ago
- (NAACL 2024) Official code repository for Mixset.β25Updated 5 months ago
- β129Updated 8 months ago
- A resource repository for machine unlearning in large language modelsβ397Updated last month
- TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)β100Updated last week
- Accepted by IJCAI-24 Survey Trackβ202Updated 8 months ago
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritizationβ22Updated 10 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β116Updated 7 months ago