google-deepmind / synthid-text
☆473Updated 2 weeks ago
Alternatives and similar repositories for synthid-text
Users that are interested in synthid-text are comparing it to the libraries listed below
Sorting:
- Open and efficient video watermarking☆380Updated last week
- Official implementation of the paper "Watermark Anything with Localized Messages"☆1,007Updated last month
- MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 Demo)☆395Updated 2 months ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆423Updated 3 months ago
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆273Updated last year
- Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"☆441Updated 4 months ago
- LiveBench: A Challenging, Contamination-Free LLM Benchmark☆732Updated this week
- LLM Self Defense: By Self Examination, LLMs know they are being tricked☆32Updated 11 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆703Updated last week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆517Updated 2 months ago
- ☆714Updated last week
- A collection of benchmarks and datasets for evaluating LLM.☆445Updated 10 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆295Updated this week
- ☆178Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆492Updated this week
- Seed-Coder is a family of open-source code LLMs comprising base, instruct and reasoning models of 8B size, developed by ByteDance Seed.☆183Updated last week
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆304Updated 3 months ago
- The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]☆243Updated 2 months ago
- DataComp for Language Models☆1,295Updated last month
- ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications☆202Updated last year
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆681Updated last month
- Simulation framework for accelerating research in Private Federated Learning☆327Updated this week
- Gemma 2 optimized for your local machine.☆369Updated 9 months ago
- Textbook on reinforcement learning from human feedback☆894Updated last week
- Red-Teaming Language Models with DSPy☆192Updated 3 months ago
- Test Software for the Characterization of AI Technologies☆248Updated last week
- Releases from OpenAI Preparedness☆736Updated this week
- ☆465Updated last month
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆19Updated 5 months ago
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal☆643Updated 9 months ago