WangRongsheng / LLM-DetectorLinks
This is an official implementation of paper "LLM-Detector: Improving AI-Generated Chinese Text Detection with Open-Source LLM Instruction Tuning"
☆13Updated last year
Alternatives and similar repositories for LLM-Detector
Users that are interested in LLM-Detector are comparing it to the libraries listed below
Sorting:
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆34Updated 5 months ago
- [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform" [ACL 2025 Findings] "C2LEVA: Toward Comprehensive and Contaminatio…☆63Updated last month
- ☆97Updated last year
- ☆36Updated 9 months ago
- ☆47Updated 9 months ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆23Updated last year
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆50Updated 2 years ago
- A framework for editing the CoTs for better factuality☆50Updated last year
- ☆40Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 2 weeks ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆80Updated last year
- ☆56Updated 7 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆66Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆18Updated 8 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆81Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆31Updated last year
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆26Updated 3 years ago
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆101Updated 2 years ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- repository for CharacterChat, a personalized social support system☆72Updated 11 months ago
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆92Updated last year
- Plug-and-Play Document Modules for Pre-trained Models☆26Updated 2 years ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 11 months ago
- ☆21Updated 2 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆74Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated last year