Dataset for the Tensor Trust project
☆49Mar 17, 2024Updated 2 years ago
Alternatives and similar repositories for tensor-trust-data
Users that are interested in tensor-trust-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jun 7, 2024Updated 2 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆76Nov 10, 2025Updated 7 months ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆640Jun 2, 2026Updated last month
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- ☆23May 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- ACL 2023 (Findings) End-to-end Cross-lingual Label Project☆15Nov 24, 2023Updated 2 years ago
- Pronto is an automation suite for deploying and managing DataStax Cassandra clusters in AWS.☆14Jul 1, 2020Updated 6 years ago
- Augmenting Statistical Models with Natural Language Parameters☆28Sep 17, 2024Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆255Feb 24, 2025Updated last year
- Large Language Model for Blockchain☆58May 24, 2023Updated 3 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).☆19Jun 21, 2024Updated 2 years ago
- ☆31Jul 14, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.☆11Nov 12, 2025Updated 7 months ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆68Mar 23, 2026Updated 3 months ago
- Adversarial detection and defense for deep learning systems using robust feature alignment☆17Nov 10, 2020Updated 5 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆446Jan 22, 2025Updated last year
- ☆143Jul 7, 2025Updated 11 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The respository describing a novel datasets for word association explanations☆13Sep 21, 2023Updated 2 years ago
- ☆19Aug 10, 2024Updated last year
- Improving Alignment and Robustness with Circuit Breakers☆264Sep 24, 2024Updated last year
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆18Dec 14, 2025Updated 6 months ago
- An isolated environment for DNS cache poisoning attack investigation and demonstration.☆10Nov 22, 2020Updated 5 years ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆118Jun 13, 2024Updated 2 years ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆111Mar 8, 2024Updated 2 years ago
- Public repo for ETH Escape CTF @ Devcon 2024: https://devcon.org/☆13Dec 11, 2024Updated last year
- ☆13Nov 8, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Mar 20, 2025Updated last year
- TAP: An automated jailbreaking method for black-box LLMs☆238Dec 10, 2024Updated last year
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆28Dec 26, 2025Updated 6 months ago
- ☆19Apr 7, 2025Updated last year
- Code for a research paper "Part-Based Models Improve Adversarial Robustness" (ICLR 2023)☆20Sep 16, 2023Updated 2 years ago
- ☆16Mar 22, 2025Updated last year
- ☆11Jan 19, 2025Updated last year