☆30Oct 21, 2025Updated 4 months ago
Alternatives and similar repositories for selfplay-redteaming
Users that are interested in selfplay-redteaming are comparing it to the libraries listed below
Sorting:
- WebPHPack is a simple php alternative to webpack for auto combining multiple JS and CSS files into single files.☆10Feb 16, 2018Updated 8 years ago
- ☆12Jul 8, 2024Updated last year
- ☆16Nov 8, 2024Updated last year
- Prompt + regex lab☆10Nov 22, 2023Updated 2 years ago
- NodeJS API Wrapper for WhatsApp☆11Apr 9, 2022Updated 3 years ago
- CR-LT KGQA Dataset Repository☆11Jun 1, 2025Updated 8 months ago
- ☆10May 27, 2024Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- A COVID-19 Virus Stats Tracking and Notification Platform☆12Dec 11, 2022Updated 3 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆20Jul 31, 2025Updated 7 months ago
- ☆19Updated this week
- ☆15Jun 18, 2024Updated last year
- 基于vue的pdf预览组件☆13Jul 20, 2022Updated 3 years ago
- Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks☆40Feb 11, 2026Updated 2 weeks ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- ☆15May 18, 2025Updated 9 months ago
- Make open-weight LLM agents play the game "Among Us", and study how the models learn and express lying and deception in the game.☆24Dec 17, 2025Updated 2 months ago
- Scripts for drawing figures in your paper☆10Jan 8, 2025Updated last year
- FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…☆13Apr 25, 2024Updated last year
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆18Oct 18, 2025Updated 4 months ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆19Sep 18, 2025Updated 5 months ago
- ☆12May 27, 2022Updated 3 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- 谷歌的人工智能库TensorFlow的PHP扩展,使用SWIG进行工作☆11Feb 7, 2017Updated 9 years ago
- 山东师范大学本科生学位论文Latex模板(2021版)☆13May 16, 2022Updated 3 years ago
- The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!☆14Apr 8, 2025Updated 10 months ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆20Feb 17, 2026Updated last week
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 3 months ago
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆18Jul 13, 2025Updated 7 months ago
- 聊天室demo☆10Mar 21, 2017Updated 8 years ago
- Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing☆14Feb 18, 2021Updated 5 years ago
- My Algorithmic trading bots and strategies for Quantitative and High-Frequency trading in FinTech☆14Apr 3, 2024Updated last year
- A sample file and folder structure for a Node.JS project.☆12Aug 20, 2017Updated 8 years ago
- Compositional Abstractions Tutorial☆13Nov 26, 2023Updated 2 years ago
- ☆11Feb 21, 2022Updated 4 years ago
- Web server to receive uploaded LaTeX and execute it in a docker container.☆15Feb 3, 2026Updated 3 weeks ago
- Electron.js application tested with Cypress - WIP☆12Sep 19, 2019Updated 6 years ago
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆26Feb 5, 2026Updated 3 weeks ago