Social-AI-Studio / ToxiCloakCN
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
☆33Updated 3 months ago
Alternatives and similar repositories for ToxiCloakCN:
Users that are interested in ToxiCloakCN are comparing it to the libraries listed below
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆14Updated 7 months ago
- ☆24Updated last year
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆29Updated 3 weeks ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆79Updated last year
- ☆23Updated 10 months ago
- ☆84Updated 4 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆107Updated 2 months ago
- ☆16Updated last year
- kaggle 2024 Eedi 第10名 金牌方案☆22Updated 3 weeks ago
- A paper list about diffusion models for natural language processing.☆180Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆100Updated 3 months ago
- ☆13Updated 6 months ago
- The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"☆70Updated 2 years ago
- [ICLR'24 Spotlight] The official codes of our work on AIGC detection: "Multiscale Positive-Unlabeled Detection of AI-Generated Texts"☆116Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆30Updated 6 months ago
- ☆80Updated last year
- A collection of survey papers and resources related to Large Language Models (LLMs).☆39Updated 11 months ago
- Released code and datas for「Multi-modal Stance Detection: New Datasets and Model」in ACL2024.☆17Updated 7 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆14Updated 5 months ago
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆24Updated last year
- The code and data of DPA-RAG☆54Updated 3 months ago
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆44Updated 10 months ago
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆42Updated 7 months ago
- A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).☆72Updated 5 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆37Updated 5 months ago
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆73Updated 3 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 9 months ago
- ☆78Updated last year
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆21Updated 6 months ago