Working Memory Attack on LLMs
☆18May 27, 2025Updated last year
Alternatives and similar repositories for working-memory-attack-on-llms
Users that are interested in working-memory-attack-on-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 11 months ago
- ☆25Jan 17, 2025Updated last year
- Adversarial Example Attacks on Policy Learners☆40Jul 23, 2020Updated 5 years ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆27Jul 6, 2024Updated last year
- ☆10Mar 22, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- annotated dataset of cyber-security related tweets☆22May 10, 2021Updated 5 years ago
- ☆13Sep 8, 2024Updated last year
- ☆26Aug 21, 2024Updated last year
- RevLLM -- Reverse Engineering Tools for Large Language Models☆22Feb 29, 2024Updated 2 years ago
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ☆15Jul 8, 2023Updated 2 years ago
- ☆14May 22, 2017Updated 9 years ago
- Official TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)☆42Dec 7, 2020Updated 5 years ago
- Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.☆11Mar 9, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆22Jun 2, 2026Updated last week
- ☆41Feb 24, 2025Updated last year
- Training of agents using Reinforcement and Imitation Learning to simulate human crowds behavior, using Unity and ML-Agents Toolkit.☆13Nov 29, 2021Updated 4 years ago
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆46Oct 12, 2022Updated 3 years ago
- ☆18Nov 8, 2024Updated last year
- A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.☆21Jul 13, 2020Updated 5 years ago
- Our Sentimental LIAR dataset is a modified and further extended version of the LIAR extension introduced by Kirilin et al. In our dataset…☆16Mar 31, 2022Updated 4 years ago
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- State-Relabeling Adversarial Active Learning☆14Aug 17, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A project from EECS6414M of Winter 2020 at York University☆11Mar 26, 2020Updated 6 years ago
- ☆14Dec 28, 2024Updated last year
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆12Sep 22, 2023Updated 2 years ago
- Codebase of https://arxiv.org/abs/2410.14923☆54Oct 22, 2024Updated last year
- ☆31Apr 14, 2026Updated 2 months ago
- ☆13Oct 21, 2021Updated 4 years ago
- ☆12Feb 21, 2022Updated 4 years ago
- Denoising Variational Autoencoder☆20Apr 26, 2018Updated 8 years ago
- ☆13Apr 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Nov 5, 2018Updated 7 years ago
- AIxCC: automated vulnerability repair via LLMs, search, and static analysis☆13Jul 16, 2024Updated last year
- ICL backdoor attack☆17Nov 4, 2024Updated last year
- deep learning, malware detection, predictive uncertainty, dataset shift, calibration, uncertainty quantification, android malware☆17Nov 30, 2021Updated 4 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- 基于MFC的文件管理器及任务管理器☆15Mar 2, 2018Updated 8 years ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆14May 24, 2024Updated 2 years ago