TrustGen/TrustEval-toolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TrustGen/TrustEval-toolkit)

TrustGen / TrustEval-toolkit

[ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.

☆132

Alternatives and similar repositories for TrustEval-toolkit

Users that are interested in TrustEval-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wyf23187 / Adaptive_Distractions
View on GitHub
NeurIPS 2025 Poster
☆26Feb 4, 2025Updated last year
HowieHwong / DataGen
View on GitHub
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆69Mar 8, 2025Updated last year
HowieHwong / sde-harness
View on GitHub
SDE-Harness (Scientific Discovery Evaluation Framework)
☆59Mar 27, 2026Updated 3 months ago
HowieHwong / TrustLLM
View on GitHub
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
☆629Jun 24, 2025Updated last year
HowieHwong / Agentic-Guardian
View on GitHub
[ICLR'26] Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
☆48Oct 26, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
limenlp / ExeVRM
View on GitHub
Official implementation for the paper "Video-Based Reward Modeling for Computer-Use Agents"
☆17Mar 14, 2026Updated 4 months ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
Flossiee / HonestyLLM
View on GitHub
[NeurIPS 2024] HonestLLM: Toward an Honest and Helpful Large Language Model
☆29Jun 10, 2025Updated last year
hwang219 / AttackFakeNews
View on GitHub
☆27Feb 28, 2023Updated 3 years ago
limenlp / SEA
View on GitHub
Official Implementation for the paper "Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base"
☆27Sep 2, 2025Updated 10 months ago
Jinxiaolong1129 / Foot-in-the-door-Jailbreak
View on GitHub
☆23May 14, 2025Updated last year
zxiangx / LC-R1
View on GitHub
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆29Oct 20, 2025Updated 9 months ago
wuxiyang1996 / AutoHallusion
View on GitHub
AutoHallusion Codebase (EMNLP 2024)
☆23Dec 6, 2024Updated last year
Dongping-Chen / Clawatar
View on GitHub
From Agentic Intelligence to Interactive Intelligence. Give your AI agent a body and home.
☆19Feb 22, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OSU-NLP-Group / SeeActChromeExtension
View on GitHub
☆18Jan 3, 2025Updated last year
JackHck / MADAug
View on GitHub
[ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum
☆20Nov 9, 2023Updated 2 years ago
wyf23187 / DyFlow
View on GitHub
NeurIPS 2025 Poster
☆22Oct 17, 2025Updated 9 months ago
Dongping-Chen / ISG
View on GitHub
(ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.
☆31Aug 7, 2025Updated 11 months ago
ShiJiawenwen / JudgeDeceiver
View on GitHub
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆41Sep 17, 2025Updated 10 months ago
KehanGuo2 / MolPuzzle
View on GitHub
[NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation
☆20Jan 2, 2026Updated 6 months ago
Nathangitlab / Backdoor-Attacks-on-Crowd-Counting
View on GitHub
this is for the ACM MM paper---Backdoor Attack on Crowd Counting
☆17Jul 10, 2022Updated 4 years ago
MINE-Lab-ND / SpectrumML_Survey_Papers
View on GitHub
☆36Feb 18, 2025Updated last year
JieyuZ2 / TaskMeAnything
View on GitHub
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
☆71Nov 27, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Episoode / Double-Bench
View on GitHub
[AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?
☆31Dec 14, 2025Updated 7 months ago
SaFo-Lab / DoxBench
View on GitHub
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
☆30Feb 7, 2026Updated 5 months ago
baixuechunzi / llm-implicit-bias
View on GitHub
☆23Mar 11, 2025Updated last year
HowieHwong / MetaTool
View on GitHub
[ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
☆115Mar 21, 2024Updated 2 years ago
weiyezhimeng / SQL-Injection-Jailbreak
View on GitHub
☆22Jul 26, 2025Updated last year
SalesforceAIResearch / CoAct-1
View on GitHub
CoAct-1: Computer-using Agents with Coding as Actions
☆27Jun 2, 2026Updated last month
limenlp / verl
View on GitHub
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆56Jun 13, 2025Updated last year
xirui-li / DrAttack
View on GitHub
Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
☆68Aug 25, 2024Updated last year
PERSONA-bench / PERSONA
View on GitHub
LLM Benchmark
☆45May 24, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AI45Lab / X-Boundary
View on GitHub
[EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…
☆41Nov 24, 2025Updated 8 months ago
OSU-NLP-Group / AgentSafety
View on GitHub
☆192Oct 31, 2025Updated 8 months ago
JackHck / SBCL
View on GitHub
[ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition
☆18Oct 30, 2023Updated 2 years ago
niklasrisse / LimitsOfML4Vuln
View on GitHub
☆26Feb 6, 2024Updated 2 years ago
Dystopians / SecDOOD
View on GitHub
[ ICCV'2025 Poster ] SecDOOD is a secure cloud-device collaboration framework for efficient on-device OOD detection without requiring de…
☆15Oct 10, 2025Updated 9 months ago
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
Dongping-Chen / MixSet
View on GitHub
(NAACL 2024) Official code repository for Mixset.
☆27Dec 4, 2024Updated last year