☆11Nov 12, 2024Updated last year
Alternatives and similar repositories for CoSafe-Dataset
Users that are interested in CoSafe-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated 11 months ago
- ☆16Sep 27, 2023Updated 2 years ago
- Red Queen Dataset and data generation template☆26Dec 26, 2025Updated 3 months ago
- These are my jupyter notebooks on ML & DL.☆13Mar 28, 2019Updated 7 years ago
- ☆14Dec 3, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A very simple tool to help you make posters at home☆11Jul 28, 2023Updated 2 years ago
- Arabic speech recognition and dialect identification (Red Hen Lab - GSoC 2018)☆17Sep 1, 2020Updated 5 years ago
- Arabic Dialect Identification on AOC data.☆24Mar 2, 2019Updated 7 years ago
- kaggle - RSNA STR Pulmonary Embolism Detection☆10Nov 22, 2020Updated 5 years ago
- ☆18Jan 3, 2025Updated last year
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆40Jul 8, 2024Updated last year
- ☆25Nov 4, 2024Updated last year
- ☆129Dec 3, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- Sentiment analysis of song lyrics compared to auditory track features and valence☆13Feb 19, 2023Updated 3 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- A minimalist quadcopter model and hover simulation built in MATLAB/Octave, based on Francesco Sabatino’s master thesis at KTH. This proje…☆13Aug 13, 2025Updated 8 months ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- ☆20May 14, 2025Updated 11 months ago
- ☆12Oct 29, 2023Updated 2 years ago
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆48May 12, 2025Updated 11 months ago
- Be notified of recent events in the news by setting up alerts. Program uses NLP techniques such as keyword matching, k-clustering and sem…☆11Jun 27, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NAACL 2025] SIUO: Cross-Modality Safety Alignment☆124Jan 31, 2025Updated last year
- 🌟 手把手教你在论文中插入代码链接☆24Aug 2, 2025Updated 8 months ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- A curated list of 150+ papers and resources on Agentic Security. Based on the survey covering the transition from passive LLMs to autonom…☆38Mar 31, 2026Updated 2 weeks ago
- Service for sending notifications.☆13Aug 28, 2022Updated 3 years ago
- Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)☆12Sep 15, 2022Updated 3 years ago
- This repository contains PyTorch implementations of deep reinforcement learning algorithms and environments for Robotics and Controls. T…☆19Mar 20, 2022Updated 4 years ago
- Python package for processing deep learning models☆15Dec 12, 2025Updated 4 months ago
- This is the codebase for defense framework described in USENIX '21 paper "WaveGuard: Understanding and Mitigating Audio Adversarial Examp…☆21Oct 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 针对Cnews数据集进行分类,使用了torchtext进行文本预处理☆11Sep 16, 2022Updated 3 years ago
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆25Jan 29, 2026Updated 2 months ago
- 中文无监督文本聚类☆14Mar 3, 2022Updated 4 years ago
- ☆39May 17, 2025Updated 10 months ago
- An end-to-end chorus detection model DeepChorus.☆37Mar 27, 2022Updated 4 years ago
- [NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the…☆91Feb 3, 2026Updated 2 months ago
- Knowledge Graph Embeddings including TransE, TransH, TransR and PTransE☆14May 3, 2018Updated 7 years ago