Code for the paper "Fishing for Magikarp"
☆186May 11, 2026Updated last week
Alternatives and similar repositories for magikarp
Users that are interested in magikarp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- General research for Dreadnode☆27Jun 17, 2024Updated last year
- ☆16May 30, 2024Updated last year
- ☆45Feb 11, 2026Updated 3 months ago
- ☆15Jun 7, 2024Updated last year
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆71Feb 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A utility to inspect, validate, sign and verify machine learning model files.☆67Feb 5, 2025Updated last year
- Arxiv + Notion Sync☆20May 12, 2025Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆71Oct 23, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆18Apr 15, 2024Updated 2 years ago
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆29Feb 25, 2021Updated 5 years ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆207Aug 10, 2024Updated last year
- Efficient Transformers with Dynamic Token Pooling☆68May 20, 2023Updated 3 years ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆994Aug 14, 2024Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- ☆45Jun 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆223Aug 10, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- ☆44Dec 28, 2022Updated 3 years ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 3 weeks ago
- Subliminal learning in LLMs: language models can transmit hidden preferences through seemingly unrelated training data.☆24Nov 9, 2025Updated 6 months ago
- Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"☆20Oct 26, 2023Updated 2 years ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆88May 14, 2024Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch - Adversarial Training☆25May 9, 2018Updated 8 years ago
- Code for Zero-Shot Tokenizer Transfer☆144Jan 14, 2025Updated last year
- ☆27Oct 6, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆243Nov 3, 2023Updated 2 years ago
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆40Jul 8, 2024Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆97Feb 9, 2023Updated 3 years ago