google-deepmind / synthid-textLinks
☆763Updated 5 months ago
Alternatives and similar repositories for synthid-text
Users that are interested in synthid-text are comparing it to the libraries listed below
Sorting:
- Open and efficient video and image watermarking☆564Updated this week
- Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"☆494Updated last month
- open source interpretability platform 🧠☆675Updated this week
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆343Updated last year
- Humanity's Last Exam☆1,323Updated 3 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆411Updated 3 months ago
- ☆2,568Updated this week
- LiveBench: A Challenging, Contamination-Free LLM Benchmark☆1,029Updated this week
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆350Updated 3 months ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆518Updated 11 months ago
- Build datasets using natural language☆559Updated 4 months ago
- The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]☆330Updated 2 months ago
- Gemma 2 optimized for your local machine.☆378Updated last year
- Dream 7B, a large diffusion language model☆1,157Updated 2 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆788Updated 6 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆742Updated 5 months ago
- ☆659Updated 4 months ago
- Moonshot - A simple and modular tool to evaluate and red-team any LLM application.☆306Updated last week
- ☆237Updated 2 months ago
- Official inference library for pre-processing of Mistral models☆849Updated this week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,244Updated this week
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal☆841Updated last year
- An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.☆907Updated 4 months ago
- OpenAI Frontier Evals☆990Updated last month
- Prompt-to-Leaderboard☆271Updated 8 months ago
- JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]☆519Updated 9 months ago
- ☆257Updated 3 weeks ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆377Updated last year
- Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs☆105Updated last year