Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
☆19Aug 22, 2024Updated last year
Alternatives and similar repositories for ferret
Users that are interested in ferret are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Test LLMs against jailbreaks and unprecedented harms☆39Oct 19, 2024Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated 2 years ago
- Our EMNLP 2022 paper on MCQA☆23Jan 15, 2023Updated 3 years ago
- ☆22Mar 16, 2023Updated 3 years ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆76Mar 3, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆115Feb 26, 2025Updated last year
- This repository contains the implementation of the paper -- KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks☆15Sep 15, 2022Updated 3 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆12Feb 25, 2023Updated 3 years ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated last year
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆111Mar 8, 2024Updated 2 years ago
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago
- ☆29Oct 14, 2021Updated 4 years ago
- ☆21Jul 26, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆89Mar 15, 2024Updated 2 years ago
- Code and data accompanying paper "Twitter Homophily: Network Based Prediction of User’s Occupation"☆19Jul 23, 2020Updated 5 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecogniti…☆19Mar 14, 2023Updated 3 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- Official implementation for Neural networks with recurrent generative feedback (NeurIPS 2020).☆22Nov 10, 2020Updated 5 years ago
- Code for the COLING 2022 paper "DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification"☆19Oct 19, 2022Updated 3 years ago
- ☆17Aug 15, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jan 14, 2025Updated last year
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆111Aug 7, 2024Updated last year
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Accompanying code for our EMNLP 2017 publication "Bringing Structure into Summaries: Crowdsourcing a Benchmark Corpus of Concept Maps"☆13Dec 5, 2017Updated 8 years ago
- ☆20Feb 20, 2025Updated last year
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- TaskWeaver Plugins☆12Jan 28, 2024Updated 2 years ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Apr 15, 2025Updated last year
- Generating and validating natural-language explanations for the brain.☆63Apr 24, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Super gorgeous, easy-to-use and convenient mind map application☆15Nov 5, 2024Updated last year
- ☆17Mar 22, 2024Updated 2 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆31Mar 9, 2025Updated last year
- ☆32May 22, 2025Updated 11 months ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆440Jan 22, 2025Updated last year
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Apr 7, 2026Updated 3 weeks ago