Code for the paper "Fishing for Magikarp"
☆191Jun 19, 2026Updated last week
Alternatives and similar repositories for magikarp
Users that are interested in magikarp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- General research for Dreadnode☆27Jun 17, 2024Updated 2 years ago
- ☆16May 30, 2024Updated 2 years ago
- ☆45Feb 11, 2026Updated 4 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- ☆15Jun 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆71Feb 22, 2024Updated 2 years ago
- A utility to inspect, validate, sign and verify machine learning model files.☆67Feb 5, 2025Updated last year
- Arxiv + Notion Sync☆20May 12, 2025Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆72Oct 23, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆18Apr 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 10 months ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆30Feb 25, 2021Updated 5 years ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆207Aug 10, 2024Updated last year
- Efficient Transformers with Dynamic Token Pooling☆68May 20, 2023Updated 3 years ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆1,010Aug 14, 2024Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- ☆26May 30, 2023Updated 3 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Contextualized per-token embeddings☆37Updated this week
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆224Aug 10, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 2 months ago
- ☆14Mar 5, 2024Updated 2 years ago
- Subliminal learning in LLMs: language models can transmit hidden preferences through seemingly unrelated training data.☆24Nov 9, 2025Updated 7 months ago
- Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"☆20Oct 26, 2023Updated 2 years ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆88May 14, 2024Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆50Dec 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Oct 15, 2019Updated 6 years ago
- Pytorch - Adversarial Training☆25May 9, 2018Updated 8 years ago
- Code for Zero-Shot Tokenizer Transfer☆145Jan 14, 2025Updated last year
- ☆27Oct 6, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆243Nov 3, 2023Updated 2 years ago
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆40Jul 8, 2024Updated last year