Social-AI-Studio / ToxiCloakCN
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
☆40Updated 7 months ago
Alternatives and similar repositories for ToxiCloakCN
Users that are interested in ToxiCloakCN are comparing it to the libraries listed below
Sorting:
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆72Updated 4 months ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆78Updated last month
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆39Updated 4 months ago
- Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆11Updated 5 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆116Updated 7 months ago
- An up-to-date curated list of Retrieval-Augmented Generation (RAG) for Large Language Models (LLMs).☆63Updated this week
- The code implementation of the paper CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learni…☆15Updated last year
- 大模型进阶面经☆48Updated last week
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 3 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆40Updated last week
- [ACL 2024]Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs☆39Updated 7 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆80Updated last year
- ☆55Updated 2 months ago
- The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"☆74Updated 2 years ago
- ☆48Updated 11 months ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection☆12Updated last week
- ☆16Updated 10 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for…☆65Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆119Updated 6 months ago
- ☆29Updated last week
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆22Updated 10 months ago
- ☆17Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆49Updated 11 months ago
- This is the repo for the survey of Bias and Fairness in IR with LLMs.☆53Updated last month
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆22Updated 9 months ago
- website repo for agent-based social movement simulation☆21Updated 11 months ago
- ☆23Updated 6 months ago
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆25Updated last week
- FedJudge: Federated Legal Large Language Model☆33Updated 7 months ago