ziyuwowo/trust-eval-mm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ziyuwowo/trust-eval-mm)

ziyuwowo / trust-eval-mm

Multi-dimensional trustworthiness evaluation for multimodal LLMs

☆127

Alternatives and similar repositories for trust-eval-mm

Users that are interested in trust-eval-mm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ziyuwowo / mllm-jailbreak-bench
View on GitHub
Reproducible benchmark for adversarial attacks on multimodal large language models
☆220May 24, 2026Updated last month
zblcving9-gif / xianxia-AI-town-AI-
View on GitHub
大模型驱动的npc决策，npc和玩家平权
☆996Mar 29, 2026Updated 3 months ago
tophant-ai / aibeat
View on GitHub
Break your AI before they do.
☆998Jun 18, 2026Updated last month
Dong90 / oh-my-taiyiforge
View on GitHub
AI workflow automation plugin for intelligent code generation with Claude/Codex
☆1,017Updated this week
kepengxu / PRISM-VL
View on GitHub
PRISM-VL studies measurement-grounded VLM learning with RAW-derived Meas.-XYZ inputs, camera-conditioned grounding, and exposure-brackete…
☆1,024May 27, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ThinkWatchProject / ThinkWatch
View on GitHub
Enterprise AI bastion host for secure AI API and MCP access, with unified proxying, RBAC, audit logs, rate limiting, and cost tracking ac…
☆951May 27, 2026Updated last month
ziyuwowo / safegate
View on GitHub
Lightweight runtime safety guard for multimodal LLM I/O
☆131May 24, 2026Updated last month
nolanx-ai / nolanx.ai
View on GitHub
Nolanx, Open-sourced AI Netflix.
☆1,491May 26, 2026Updated last month
MatrixAges / polywise
View on GitHub
The open source agentic content system to make your contents alive. Self-hosted on any platform. ◑
☆756Jun 11, 2026Updated last month
im4codes / imcodes
View on GitHub
The IM for agents. Shared Agent Context & Memory, supervised execution, and cross-agent audit across AI providers.
☆1,049Updated this week
zengxiao-he / tessera
View on GitHub
From teacher to tiles — a from-scratch LLM distillation & serving engine: custom Triton/CUDA kernels, FSDP distillation, paged-KV continu…
☆588Jun 5, 2026Updated last month
devmikets / hyperliquid-sdk
View on GitHub
hyperliquid sdk | hyperliquid sdk | hyperliquid sdk | hyperliquid sdk | hyperliquid sdk | hyperliquid sdk | hyperliquid sdk | hyperliquid…
☆561May 15, 2026Updated 2 months ago
machinepulse-ai / world2agent
View on GitHub
World2Agent(W2A) is an open protocol that standardizes how Al agents perceive the real world.
☆1,244May 9, 2026Updated 2 months ago
flatkey-ai / awesome-images
View on GitHub
Try flatkey.ai for 40% saving! Generate practical images ready for all your work needs!!!
☆675Jun 19, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
fuyuxiang / echo-agent
View on GitHub
Echo Agent 是一个可自托管、长期运行、持续学习的 AI Agent，面向个人与团队的私有自动化场景。它可以部署在自有服务器上，统一连接模型、工具、记忆、权限与消息入口。内置四层认知记忆、遗忘曲线与矛盾检测机制，能够在跨会话任务中持续沉淀上下文，并保持长期记忆的质量…
☆720Updated this week
lingyuanli / MultiGen
View on GitHub
Multi-agent end-to-end application - General-purpose artificial intelligence agent for multimodal agent collaboration
☆467Jun 6, 2026Updated last month
dev-polymarket / clob-client-v2
View on GitHub
polymarket clob | polymarket clob | polymarket clob | polymarket clob | polymarket clob | polymarket clob | polymarket clob | polymarket …
☆563Jun 6, 2026Updated last month
liweicong2016-collab / my-website
View on GitHub
☆482Jun 18, 2026Updated last month
worldliberty / agentpay-sdk
View on GitHub
An open SDK for agentic payments. Let AI agents make payments, hold funds, and move money across chains with policy enforcement and human…
☆516Jun 8, 2026Updated last month
chenyuan200356-droid / crypto-news-bot
View on GitHub
☆485Apr 5, 2026Updated 3 months ago
Q-Future / Q-ReAlign
View on GitHub
Q-Align-style framework with lightweight LMMs, including 0.8B, 4B, and 9B checkpoints, dataset builders, caching, scoring, and CLI workfl…
☆463Jun 24, 2026Updated 3 weeks ago
linxidnju / OpenTag
View on GitHub
Open-source, channel-native agent gateway for Slack. Route team threads to Claude Code, Codex, OpenCode, Docker, HTTP agents, and custom …
☆544Jul 10, 2026Updated last week
MetapriseAI / OrgKernel
View on GitHub
Open-source trust layer for AI agents — cryptographic agent identity (Ed25519), instance-scoped execution tokens, SHA-256 hash-chained au…
☆2,120Jul 6, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aaryansamanta / ai-research-publications
View on GitHub
Collection of my peer-reviewed high-school research: IEEE AIAM 2025 (quantum-inspired GA + GNN for multimodal classification) + IJHSR/UCS…
☆810Feb 19, 2026Updated 5 months ago
GizClaw / flowcraft
View on GitHub
Production-grade Go SDK for building AI agents with long-term memory, knowledge retrieval, and voice — runnable as a library, a daemon, o…
☆486Updated this week
xiaoshideta / Streaming-dLLM
View on GitHub
Diffusion Language Model
☆420Jun 24, 2026Updated 3 weeks ago
OpenRaiser / NanoResearch
View on GitHub
🦞+🔬 NanoResearch: The Autonomous AI Research Assistant
☆1,482May 26, 2026Updated last month
cPilot-GUI / Amis
View on GitHub
🖥️ A local-first desktop workspace for private agents, local models, runtime visibility, and local/cloud routing.
☆566Jun 24, 2026Updated 3 weeks ago
Octoday-Hub / Embodied-AI
View on GitHub
星期八 Octoday 「具身智能知识索引与产业地图」
☆2,121Jun 26, 2026Updated 3 weeks ago
HangYu8123 / HarnessFlow
View on GitHub
Harness coding workflow for codex, claude, github copilot
☆523Updated this week
OpenNSWM-Lab / FAROS
View on GitHub
A blueprint-driven AutoResearch runtime for orchestrating AI research workflows from idea generation and experiments to paper writing and…
☆2,109Updated this week
ascending-llc / jarvis-registry
View on GitHub
Connect any AI copilot or autonomous agent to your enterprise tools — through a single, secure MCP/Agent gateway with built-in identity, …
☆2,366Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yourdadisco / hotspot_analysis
View on GitHub
AI热点解析助手，用于分析AI前沿信息是否对业务有帮助，紧跟AI技术迭代的同时也能忽略一些无关信息。
☆492Jun 29, 2026Updated 3 weeks ago
PanqiYang1 / MUSE
View on GitHub
ICML2026: Resolving Manifold Misalignment in Visual Tokenization via Topological Orthogonality
☆379Jun 1, 2026Updated last month
Tong89 / smartNode
View on GitHub
☆1,923Jun 2, 2026Updated last month
solo-xin / SOLO-Builders
View on GitHub
A curated resource collection for independent developers, AI builders, and one-person companies.
☆473May 7, 2026Updated 2 months ago
AudarAI / Audar-ASR-V1
View on GitHub
Arabic-first generative speech recognition — Audar-ASR-V1 (Flash + Turbo). #1 on the Open Universal Arabic ASR Leaderboard. Model cards, …
☆556Updated this week
aaryansamanta / usaco-tripple-perfect
View on GitHub
Showcasing my 2025 USACO US Open dual perfect scores (1000/1000 in both Gold & Silver divisions) — one of only 8 U.S. high schoolers nati…
☆813Feb 22, 2026Updated 5 months ago
RickyTong1 / audit-harness
View on GitHub
Three-layer audit enforcement framework for AI agents — hooks, skills, context recovery, and audit-driven daily reports
☆577Jun 10, 2026Updated last month