Dataset for the Tensor Trust project
☆47Mar 17, 2024Updated 2 years ago
Alternatives and similar repositories for tensor-trust-data
Users that are interested in tensor-trust-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jun 7, 2024Updated last year
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆72Nov 10, 2025Updated 6 months ago
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- ☆23May 20, 2025Updated last year
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACL 2023 (Findings) End-to-end Cross-lingual Label Project☆15Nov 24, 2023Updated 2 years ago
- Pronto is an automation suite for deploying and managing DataStax Cassandra clusters in AWS.☆14Jul 1, 2020Updated 5 years ago
- Augmenting Statistical Models with Natural Language Parameters☆28Sep 17, 2024Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆251Feb 24, 2025Updated last year
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- robust polynomial multiplication in modulo m☆19Apr 30, 2016Updated 10 years ago
- ASGI Middleware for serving static file.☆15Dec 18, 2025Updated 5 months ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆31Jul 14, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated last month
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- Recognize faces/objects in a video stream (from a webcam or a security camera) and send notifications to your devices☆12May 12, 2024Updated 2 years ago
- ☆12Jun 13, 2025Updated 11 months ago
- ☆139Jul 7, 2025Updated 10 months ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 10 months ago
- ☆19Aug 10, 2024Updated last year
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆152Jul 19, 2024Updated last year
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆117Jun 13, 2024Updated last year
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆111Mar 8, 2024Updated 2 years ago
- Public repo for ETH Escape CTF @ Devcon 2024: https://devcon.org/☆13Dec 11, 2024Updated last year
- TAP: An automated jailbreaking method for black-box LLMs☆231Dec 10, 2024Updated last year
- ☆17Mar 20, 2025Updated last year
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆27Dec 26, 2025Updated 4 months ago
- ☆18Apr 7, 2025Updated last year
- Code for a research paper "Part-Based Models Improve Adversarial Robustness" (ICLR 2023)☆20Sep 16, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Mar 22, 2025Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆732Jul 2, 2025Updated 10 months ago
- ☆20Mar 20, 2025Updated last year
- QL-Relax☆13Aug 12, 2025Updated 9 months ago