Social-AI-Studio / ToxiCloakCNLinks
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
☆43Updated last year
Alternatives and similar repositories for ToxiCloakCN
Users that are interested in ToxiCloakCN are comparing it to the libraries listed below
Sorting:
- ☆33Updated last year
- ☆179Updated last year
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆65Updated 7 months ago
- This is a repository dedicated to high quality figures from ACL 2025 long papers.☆135Updated 3 weeks ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆101Updated 7 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆21Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Updated 2 years ago
- 大模型进阶面经☆93Updated 8 months ago
- ☆98Updated 6 months ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆121Updated 8 months ago
- The lastest paper about detection of LLM-generated text and code☆281Updated 6 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆57Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆308Updated 2 years ago
- Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆369Updated 3 months ago
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆21Updated 7 months ago
- ☆133Updated last week
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆130Updated last year
- ☆62Updated 2 years ago
- ☆27Updated 2 years ago
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆18Updated last year
- A live reading list for LLM data synthesis (Updated to July, 2025).☆435Updated 4 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆62Updated last year
- ☆29Updated last year
- kaggle 2024 Eedi 第10名 金牌方案☆44Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆139Updated last year
- NeurIPS 2025 Poster☆26Updated 11 months ago
- A collection of resources that investigate social agents.☆212Updated 8 months ago
- ☆36Updated last year