i-gao/model-equality-testing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/i-gao/model-equality-testing)

i-gao / model-equality-testing

Test equality between a black-box LLM API and a reference distribution

☆18

Alternatives and similar repositories for model-equality-testing

Users that are interested in model-equality-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EECS150 / fpga_labs_fa21
View on GitHub
FPGA Labs for EECS 151/251A (Fall 2021)
☆12Oct 20, 2021Updated 4 years ago
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
facebookresearch / dualformer
View on GitHub
implementation of dualformer
☆25Mar 1, 2025Updated last year
parameterlab / trap
View on GitHub
Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)
☆14Nov 20, 2024Updated last year
WhitolfChen / REFINE
View on GitHub
[ICLR 2025] REFINE: Inversion-Free Backdoor Defense via Model Reprogramming
☆13Feb 13, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Xuzhenhua55 / awesome-llm-copyright-protection
View on GitHub
A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…
☆51Jun 10, 2026Updated 3 weeks ago
dkout / 18.065
View on GitHub
Matrix Methods In Data Analysis, Signal Processing, And Machine Learning
☆10Sep 2, 2018Updated 7 years ago
Fangjun-Li / SpatialLM-StepGame
View on GitHub
Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the …
☆14Apr 23, 2024Updated 2 years ago
flock-org / flock-artifact
View on GitHub
A framework for deploying on-demand distributed-trust.
☆14Jun 4, 2024Updated 2 years ago
johnmyleswhite / StatsFunctionsNotes
View on GitHub
Jupyter notebooks showing to implement statistical functions.
☆14Jun 14, 2020Updated 6 years ago
IT3DEgo / IT3DEgo
View on GitHub
CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"
☆19Jun 27, 2024Updated 2 years ago
DavidFanzz / llm_decoding
View on GitHub
☆12Apr 25, 2025Updated last year
indexofknowledge / iok
View on GitHub
Index of Knowledge
☆16Jan 6, 2023Updated 3 years ago
Trust4AI / ASTRAL
View on GitHub
Automated Safety Testing of Large Language Models
☆18Jan 31, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ScalingIntelligence / large_language_monkeys
View on GitHub
☆116Sep 25, 2024Updated last year
sail-sg / P-DoS
View on GitHub
[ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models
☆23Oct 22, 2024Updated last year
MadryLab / bias-transfer
View on GitHub
☆15Jul 24, 2022Updated 3 years ago
aypan17 / latentqa
View on GitHub
☆34Nov 16, 2025Updated 7 months ago
kkkevinkkkkk / situated_faithfulness
View on GitHub
☆14Oct 17, 2024Updated last year
telepath-computer / stash
View on GitHub
Keep any folder in sync across computers, conflict-free.
☆62May 18, 2026Updated last month
LUMIA-Group / HuRef
View on GitHub
Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)
☆16Jun 17, 2025Updated last year
AISG-Technology-Team / GCSS-Track-1A-Submission-Guide
View on GitHub
Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).
☆16Jul 4, 2024Updated last year
shinington / Robust-PDF-Classifier-with-Conserved-Features
View on GitHub
Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…
☆11Aug 5, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
shauli-ravfogel / adv-kernel-removal
View on GitHub
☆12Oct 23, 2022Updated 3 years ago
flukeskywalker / nanoDD
View on GitHub
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Sep 27, 2024Updated last year
CUHK-Shenzhen-SE / D4C
View on GitHub
[ICSE'25] Aligning the Objective of LLM-based Program Repair
☆24Mar 8, 2025Updated last year
tom-pollak / claudette-pydantic
View on GitHub
☆10Oct 22, 2024Updated last year
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 8 months ago
shinington / facesec
View on GitHub
Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021
☆13Jun 22, 2021Updated 5 years ago
inspire-group / tta_risk
View on GitHub
☆15Jun 6, 2023Updated 3 years ago
p2c2e / mcp_proxy_pydantic_agent
View on GitHub
Example for exposing MCP servers to Pydantic Agents
☆18Mar 16, 2025Updated last year
T0hsakar1n / RAPID
View on GitHub
Source code and scripts for the paper "Is Difficulty Calibration All We Need? Towards More Practical Membership Inference Attacks"
☆20Dec 10, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
selfdefend / Code
View on GitHub
☆34Jan 26, 2025Updated last year
twj-KAIST / OOD-MAML
View on GitHub
☆13Oct 14, 2020Updated 5 years ago
yoheikikuta / adversarial-camera-stickers
View on GitHub
A very limited implementation of arXiv:1904.00759
☆13Dec 2, 2019Updated 6 years ago
sharontlin / undergrad-summer-opportunities
View on GitHub
Programs/fellowships for undergrads
☆11Mar 25, 2020Updated 6 years ago
MLforHealth / MIMIC_Generalisation
View on GitHub
Code to study the generalisability of benchmark models on non-stationary EHRs.
☆15Aug 7, 2019Updated 6 years ago
shacharKZ / Visualizing-the-Information-Flow-of-GPT
View on GitHub
☆10Oct 14, 2023Updated 2 years ago
lchen001 / HAPI
View on GitHub
☆16Nov 30, 2022Updated 3 years ago