Social-AI-Studio / ToxiCloakCNLinks
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
☆43Updated last year
Alternatives and similar repositories for ToxiCloakCN
Users that are interested in ToxiCloakCN are comparing it to the libraries listed below
Sorting:
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆61Updated 7 months ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆97Updated 6 months ago
- ☆177Updated last year
- This is a repository dedicated to high quality figures from ACL 2025 long papers.☆130Updated this week
- ☆32Updated last year
- 大模型进阶面经☆87Updated 7 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆21Updated last year
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆115Updated 8 months ago
- The lastest paper about detection of LLM-generated text and code☆280Updated 6 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆302Updated 2 years ago
- SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for…☆87Updated last year
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆55Updated 7 months ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆99Updated 2 years ago
- ☆96Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆138Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark☆42Updated 3 months ago
- ☆35Updated last year
- ☆137Updated 9 months ago
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆56Updated 11 months ago
- ☆27Updated 2 years ago
- Code for paper 'Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning'☆18Updated last year
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆90Updated last year
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆21Updated 7 months ago
- The code implementation of the paper CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learni…☆16Updated last year
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆87Updated 4 years ago
- ☆171Updated 3 weeks ago
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆64Updated 6 months ago
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆39Updated last month