liyucheng09/Contamination_Detector

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liyucheng09/Contamination_Detector)

liyucheng09 / Contamination_Detector

Lightweight tool to identify Data Contamination in LLMs evaluation

☆53

Alternatives and similar repositories for Contamination_Detector

Users that are interested in Contamination_Detector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
pillowsofwind / Course-Correction
View on GitHub
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆20Oct 2, 2024Updated last year
JIA-Lab-research / Mr-Ben
View on GitHub
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"
☆51Oct 31, 2024Updated last year
AlongWY / gpustat
View on GitHub
📊 A simple command-line utility for querying and monitoring GPU status
☆14Aug 3, 2023Updated 2 years ago
AlongWY / pysonic
View on GitHub
☆18Feb 20, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
USTC-StarTeam / ZIP
View on GitHub
arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.
☆28Jun 10, 2026Updated last month
yjywdzh / ACE
View on GitHub
This repository refers to the codes of paper ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
☆15Jan 31, 2026Updated 5 months ago
Dahoas / QDSyntheticData
View on GitHub
☆14Aug 15, 2024Updated last year
yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year
AIoT-MLSys-Lab / MMDeepResearch-Bench
View on GitHub
MMDeepResearch-Bench (MMDR)
☆31Apr 1, 2026Updated 3 months ago
AI4fun / DQ-LoRe
View on GitHub
☆13Jun 26, 2024Updated 2 years ago
RobustNLP / DeRTa
View on GitHub
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
☆72May 22, 2025Updated last year
rookie-joe / FormalAlign
View on GitHub
☆17Jul 12, 2025Updated last year
HKUNLP / ZeroGen
View on GitHub
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆16Feb 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
THU-KEG / Event-Level-Knowledge-Editing
View on GitHub
☆12Apr 25, 2024Updated 2 years ago
THU-KEG / Xlore2.0
View on GitHub
Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]
☆12Apr 5, 2017Updated 9 years ago
whyNLP / Conic10K
View on GitHub
Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.
☆33Dec 6, 2023Updated 2 years ago
ajtejankar / mixtral-vis-moe
View on GitHub
Visualize expert firing frequencies across sentences in the Mixtral MoE model
☆18Dec 22, 2023Updated 2 years ago
fish98 / CAShift
View on GitHub
CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift (FSE 2025)
☆16Jun 25, 2026Updated 3 weeks ago
ganeshdg95 / Leveraging-Adversarial-Examples-to-Quantify-Membership-Information-Leakage
View on GitHub
☆19Mar 6, 2023Updated 3 years ago
DependableSystemsLab / MIA_defense_HAMP
View on GitHub
Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …
☆13Sep 6, 2023Updated 2 years ago
amazon-science / llm-code-preference
View on GitHub
Training and Benchmarking LLMs for Code Preference.
☆38Nov 15, 2024Updated last year
francescortu / comp-mech
View on GitHub
Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024
☆13May 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LuoXiaoHeics / Continual-Tune
View on GitHub
☆10Feb 6, 2025Updated last year
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
felipemaiapolo / tinyBenchmarks
View on GitHub
Evaluating LLMs with fewer examples
☆181Jul 4, 2026Updated 2 weeks ago
NathanGodey / qfilters
View on GitHub
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆34Mar 7, 2025Updated last year
decoding-comp-trust / comp-trust
View on GitHub
Codebase for decoding compressed trust.
☆27May 7, 2024Updated 2 years ago
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
zhxieml / remiss-jailbreak
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
taishan1994 / pytorch_bert_coreference_resolution
View on GitHub
基于pytorch+bert的指代消解
☆14Sep 16, 2021Updated 4 years ago
yangzhch6 / DARS
View on GitHub
The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)
☆24Feb 4, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zweiein / End_to_end_Speech_Papers
View on GitHub
☆13Sep 12, 2017Updated 8 years ago
llylly / RANUM
View on GitHub
[ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.
☆13Jan 5, 2024Updated 2 years ago
carriex / lfqa_eval
View on GitHub
ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Mar 22, 2024Updated 2 years ago
tbh-98 / Hypergraph-MLP
View on GitHub
☆20Jan 9, 2024Updated 2 years ago
Xiaoyu-SZ / LLMasEvaluator
View on GitHub
Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)
☆21Aug 13, 2025Updated 11 months ago
csong27 / auditing-text-generation
View on GitHub
Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)
☆10Jun 18, 2019Updated 7 years ago
swj0419 / detect-pretrain-code
View on GitHub
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆243Nov 3, 2023Updated 2 years ago