Codebase for Obfuscated Activations Bypass LLM Latent-Space Defenses
☆31Feb 11, 2025Updated last year
Alternatives and similar repositories for obfuscated-activations
Users that are interested in obfuscated-activations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python sdk for LLM finetuning and inference on runpod infrastructure☆30May 12, 2026Updated last month
- ☆15Mar 9, 2025Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- ☆20Feb 11, 2024Updated 2 years ago
- ☆24Jul 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆139Jul 7, 2025Updated 11 months ago
- ☆20Nov 15, 2024Updated last year
- Jax Decompiler☆16Apr 22, 2025Updated last year
- A benchmark for mechanistic discovery of circuits in Transformers☆17Dec 15, 2024Updated last year
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- Unofficial faiss wheel builder for NVIDIA GPU☆36Apr 29, 2026Updated last month
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆20Jan 27, 2024Updated 2 years ago
- Universal Robustness Evaluation Toolkit (for Evasion)☆32Sep 17, 2025Updated 8 months ago
- [NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…☆21Oct 1, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆30Sep 15, 2024Updated last year
- Measuring the situational awareness of language models☆41Feb 12, 2024Updated 2 years ago
- VulnGym: A Real-World, Project-Level Vulnerability Benchmark for White-Box Vulnerability-Hunting Agents☆159Jun 2, 2026Updated last week
- This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…☆23Oct 30, 2023Updated 2 years ago
- Repository with sample code using Apollo's suggested engineering practices☆15Dec 16, 2024Updated last year
- ☆10Mar 24, 2025Updated last year
- ☆13Aug 19, 2024Updated last year
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Jul 17, 2025Updated 10 months ago
- ☆24Dec 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆42Jul 6, 2025Updated 11 months ago
- Parallel hyperparameter tuning with JAX☆39May 21, 2026Updated 3 weeks ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆105Jan 20, 2025Updated last year
- ☆27Nov 9, 2022Updated 3 years ago
- ☆18Mar 30, 2025Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"