☆23May 20, 2025Updated last year
Alternatives and similar repositories for libra-eval
Users that are interested in libra-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆38Sep 29, 2025Updated 7 months ago
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- ☆26Aug 8, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 3 years ago
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- ☆22Dec 18, 2024Updated last year
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆16Jun 7, 2024Updated last year
- Papers about red teaming LLMs and Multimodal models.☆165May 28, 2025Updated 11 months ago
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- Envoy Wasm filter for traffic tracing used in APIClarity.☆13Jun 19, 2024Updated last year
- Advanced Machine Learning Fall 2020 Project Repository☆12Dec 12, 2020Updated 5 years ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Oct 23, 2022Updated 3 years ago
- A Burp Suite extension that converts IP addresses to decimal notation, useful for SSRF bypass and WAF evasion testing. Created by Harshad…☆12Dec 9, 2024Updated last year
- Dataset for the Tensor Trust project☆47Mar 17, 2024Updated 2 years ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- LLM手撕代码合集☆21Mar 25, 2025Updated last year
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆27Dec 26, 2025Updated 5 months ago
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆38Oct 18, 2023Updated 2 years ago
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated last year
- ☆15Feb 26, 2025Updated last year
- Complete set of English dialect transformation rules and evaluation code☆17Jun 7, 2024Updated last year
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆31Sep 29, 2025Updated 7 months ago
- ☆11Jun 11, 2025Updated 11 months ago
- UCAS大三自然语言处理课程大作业☆12Jun 25, 2023Updated 2 years ago
- ☆12Apr 18, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- ☆15Mar 6, 2026Updated 2 months ago
- Data augmentation using OpenCV☆11Jan 12, 2017Updated 9 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- 供大学生,竞赛生,高中生查找的math-wiki☆10May 26, 2022Updated 4 years ago
- ☆53Mar 31, 2026Updated last month
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago