baoguangsheng / glimpseLinks
Code base for ICLR 2025 "Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection"
☆39Updated 3 weeks ago
Alternatives and similar repositories for glimpse
Users that are interested in glimpse are comparing it to the libraries listed below
Sorting:
- Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆329Updated this week
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆87Updated 3 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆158Updated 5 months ago
- ☆29Updated last year
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.☆224Updated 3 months ago
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆35Updated 6 months ago
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆37Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆59Updated last year
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆81Updated last month
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆92Updated last year
- The lastest paper about detection of LLM-generated text and code☆276Updated 2 months ago
- A framework for editing the CoTs for better factuality☆51Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆231Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆134Updated 11 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆76Updated 9 months ago
- ☆157Updated 11 months ago
- ☆20Updated last year
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆106Updated last year
- Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]☆245Updated last month
- (NAACL 2024) Official code repository for Mixset.☆26Updated 9 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆23Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆226Updated 8 months ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆89Updated 3 months ago
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆44Updated 5 months ago
- Benchmarking LLMs' Psychological Portrayal☆123Updated 8 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆50Updated last year
- LLM Unlearning☆174Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated last year
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆77Updated last year