Social-AI-Studio / ToxiCloakCNLinks
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
☆43Updated last year
Alternatives and similar repositories for ToxiCloakCN
Users that are interested in ToxiCloakCN are comparing it to the libraries listed below
Sorting:
- ☆179Updated last year
- ☆133Updated 2 weeks ago
- This is a repository dedicated to high quality figures from ACL 2025 long papers.☆135Updated 3 weeks ago
- ☆33Updated last year
- ☆180Updated last month
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆65Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- Implementation of "REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering"☆34Updated last year
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆121Updated 8 months ago
- website repo for agent-based social movement simulation☆27Updated last year
- ☆98Updated 6 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- 大模型进阶面经☆93Updated 8 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆139Updated last year
- The lastest paper about detection of LLM-generated text and code☆282Updated 6 months ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆101Updated 7 months ago
- SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for…☆89Updated last year
- [ACL 2025] Removal of Hallucination on Hallucination: Debate-Augmented RAG☆33Updated 5 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆382Updated 2 months ago
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆56Updated last year
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆57Updated 8 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆62Updated last year
- Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆370Updated 3 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Updated 2 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆89Updated 11 months ago
- Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"☆69Updated 6 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆21Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Updated 2 years ago
- A Collection of Papers about Memory for Language Agents☆245Updated 3 weeks ago
- A collection of resources that investigate social agents.☆212Updated 8 months ago