Code for the paper "Fishing for Magikarp"
☆185Apr 14, 2026Updated 2 weeks ago
Alternatives and similar repositories for magikarp
Users that are interested in magikarp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- General research for Dreadnode☆27Jun 17, 2024Updated last year
- ☆16May 30, 2024Updated last year
- ☆45Feb 11, 2026Updated 2 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- ☆15Jun 7, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated 2 years ago
- A utility to inspect, validate, sign and verify machine learning model files.☆67Feb 5, 2025Updated last year
- Arxiv + Notion Sync☆20May 12, 2025Updated 11 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ☆22Dec 18, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆18Apr 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 8 months ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆29Feb 25, 2021Updated 5 years ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆208Aug 10, 2024Updated last year
- Efficient Transformers with Dynamic Token Pooling☆68May 20, 2023Updated 2 years ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆986Aug 14, 2024Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- Subliminal learning in LLMs: language models can transmit hidden preferences through seemingly unrelated training data.☆21Nov 9, 2025Updated 5 months ago
- Contextualized per-token embeddings☆35May 11, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆45Jun 19, 2024Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆223Aug 10, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- A fork of sqlite-utils with CLI etc removed☆17Apr 6, 2026Updated 3 weeks ago
- ☆14Mar 5, 2024Updated 2 years ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated 3 weeks ago
- Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"☆20Oct 26, 2023Updated 2 years ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆88May 14, 2024Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Oct 15, 2019Updated 6 years ago
- Pytorch - Adversarial Training☆25May 9, 2018Updated 7 years ago
- Code for Zero-Shot Tokenizer Transfer☆144Jan 14, 2025Updated last year
- ☆27Oct 6, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆244Nov 3, 2023Updated 2 years ago
- ☆131Dec 3, 2025Updated 4 months ago