google-deepmind / synthid-textLinks
☆709Updated 4 months ago
Alternatives and similar repositories for synthid-text
Users that are interested in synthid-text are comparing it to the libraries listed below
Sorting:
- Open and efficient video and image watermarking☆533Updated last week
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆332Updated last year
- Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"☆490Updated last week
- Official implementation of the paper "Watermark Anything with Localized Messages"☆1,091Updated 6 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆388Updated 2 months ago
- Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs☆100Updated last year
- open source interpretability platform 🧠☆590Updated this week
- LiveBench: A Challenging, Contamination-Free LLM Benchmark☆980Updated this week
- The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]☆320Updated last month
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆501Updated 10 months ago
- Humanity's Last Exam☆1,280Updated 2 months ago
- Code release for Best-of-N Jailbreaking☆550Updated 10 months ago
- Build datasets using natural language☆556Updated 3 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆778Updated 5 months ago
- ☆658Updated 3 months ago
- ☆235Updated last month
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal☆813Updated last year
- ☆2,503Updated last week
- Official inference library for pre-processing of Mistral models☆832Updated this week
- Prompt-to-Leaderboard☆269Updated 7 months ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆343Updated 2 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆737Updated 4 months ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆368Updated 11 months ago
- Gemma 2 optimized for your local machine.☆378Updated last year
- [NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.☆176Updated 8 months ago
- LettuceDetect is a hallucination detection framework for RAG applications.☆520Updated 3 months ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆384Updated 3 weeks ago
- ☆481Updated 5 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆123Updated 2 months ago
- Model Activity Visualiser☆519Updated 8 months ago