Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
☆19Aug 22, 2024Updated last year
Alternatives and similar repositories for ferret
Users that are interested in ferret are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- Test LLMs against jailbreaks and unprecedented harms☆40Oct 19, 2024Updated last year
- This repository contains the dataset and the pytorch implementations of the models from the paper CIDER: Commonsense Inference for Dialog…☆27Oct 30, 2022Updated 3 years ago
- ☆22Mar 16, 2023Updated 3 years ago
- ☆40Apr 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆76Mar 3, 2025Updated last year
- This repository contains the implementation of the paper -- KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks☆15Sep 15, 2022Updated 3 years ago
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆37Jul 12, 2024Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆83May 17, 2025Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Mar 12, 2023Updated 3 years ago
- Dictionary of English words tagged with their natural gender.☆13Sep 7, 2021Updated 4 years ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated last year
- A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022☆34Oct 9, 2023Updated 2 years ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆111Mar 8, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago
- ☆31Oct 14, 2021Updated 4 years ago
- ☆22Jul 26, 2025Updated 10 months ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆18Feb 6, 2023Updated 3 years ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆89Mar 15, 2024Updated 2 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecogniti…☆19Mar 14, 2023Updated 3 years ago
- Code for the COLING 2022 paper "DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification"☆19Oct 19, 2022Updated 3 years ago
- ☆23Jul 29, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Lateral Inhibition-Inspired Convolutional Neural Network for Visual Attention and Saliency Detection☆13Nov 6, 2020Updated 5 years ago
- AIxCC: automated vulnerability repair via LLMs, search, and static analysis☆13Jul 16, 2024Updated last year
- ☆13Jan 14, 2025Updated last year
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆111Aug 7, 2024Updated last year
- ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents☆29Oct 28, 2024Updated last year
- ☆20Feb 20, 2025Updated last year
- ☆13Dec 22, 2023Updated 2 years ago
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- TaskWeaver Plugins☆12Jan 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Boundary Attributions for Normal (Vector) Explanations☆11Aug 13, 2021Updated 4 years ago
- Generating and validating natural-language explanations for the brain.☆65Jun 1, 2026Updated last week
- Super gorgeous, easy-to-use and convenient mind map application☆15Nov 5, 2024Updated last year
- ☆17Mar 22, 2024Updated 2 years ago
- ☆16Jun 8, 2023Updated 3 years ago
- ☆60Mar 9, 2023Updated 3 years ago
- A simple random onion site scanner with onion list☆17Aug 7, 2018Updated 7 years ago