cohere-ai/magikarp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cohere-ai/magikarp)

cohere-ai / magikarp

Code for the paper "Fishing for Magikarp"

☆191

Alternatives and similar repositories for magikarp

Users that are interested in magikarp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cisnlp / multypo
View on GitHub
A Multilingual Keyboard Layout-Based Typo Generator
☆17Nov 23, 2025Updated 7 months ago
haizelabs / BEAST-implementation
View on GitHub
☆16May 30, 2024Updated 2 years ago
dreadnode / research
View on GitHub
General research for Dreadnode
☆28Jun 17, 2024Updated 2 years ago
sanderland / script_tok
View on GitHub
Code for the paper "BPE stays on SCRIPT", "Which Pieces Does Unigram Tokenization Really Need?" and MinGram
☆18Jun 26, 2026Updated 3 weeks ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
JonasGeiping / carving
View on GitHub
Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives
☆71Feb 22, 2024Updated 2 years ago
humane-intelligence / ai_village_defcon_grt_data
View on GitHub
☆15Jun 7, 2024Updated 2 years ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆19Apr 18, 2026Updated 3 months ago
pchizhov / picky_bpe
View on GitHub
BPE modification that implements removing of the intermediate tokens during tokenizer training.
☆27Nov 25, 2024Updated last year
catherinearnett / morphscore
View on GitHub
This is the repository for MorphScore, a tokenizer evaluation framework for morphological alignment.
☆17Jul 10, 2025Updated last year
cimeister / tokenizer-intrinsic-evals
View on GitHub
TokEval: intrinsic quality metrics for tokenizers across natural language, code, and math
☆46Jul 4, 2026Updated 2 weeks ago
SheltonLiu-N / Universal-Prompt-Injection
View on GitHub
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
☆73Oct 23, 2024Updated last year
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
john-hewitt / embed-init
View on GitHub
Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs
☆19Dec 10, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ckkissane / sae-transfer
View on GitHub
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆13Jul 18, 2024Updated 2 years ago
UKPLab / maps
View on GitHub
Multicultural Proverbs and Sayings
☆13Jan 11, 2025Updated last year
McGill-NLP / AdversarialTriggers
View on GitHub
TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
☆19Aug 17, 2025Updated 11 months ago
dreadnode / conferences
View on GitHub
☆18Apr 15, 2024Updated 2 years ago
ischlag / Fast-Weight-Memory-public
View on GitHub
Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.
☆30Feb 25, 2021Updated 5 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
imoneoi / multipack
View on GitHub
Multipack distributed sampler for fast padding-free training of LLMs
☆207Aug 10, 2024Updated last year
chawins / pal
View on GitHub
PAL: Proxy-Guided Black-Box Attack on Large Language Models
☆57Aug 17, 2024Updated last year
andyzoujm / representation-engineering
View on GitHub
Representation Engineering: A Top-Down Approach to AI Transparency
☆1,012Aug 14, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dreadnode / paperstack
View on GitHub
Arxiv + Notion Sync
☆20May 12, 2025Updated last year
PiotrNawrot / dynamic-pooling
View on GitHub
Efficient Transformers with Dynamic Token Pooling
☆68May 20, 2023Updated 3 years ago
Zyphra / Zyda_processing
View on GitHub
☆44Jun 19, 2024Updated 2 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
facebookresearch / Shepherd
View on GitHub
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆224Aug 10, 2023Updated 2 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
AnswerDotAI / sqlite-minutils
View on GitHub
A fork of sqlite-utils with CLI etc removed
☆17Jul 11, 2026Updated last week
vinusankars / BEAST
View on GitHub
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆89May 14, 2024Updated 2 years ago
yjw1029 / Self-Reminder-Data
View on GitHub
Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"
☆20Oct 26, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LLM360 / TxT360
View on GitHub
☆25Dec 18, 2024Updated last year
zorazrw / odex
View on GitHub
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆49Dec 22, 2023Updated 2 years ago
TurkuNLP / bert-eval
View on GitHub
☆10Oct 15, 2019Updated 6 years ago
karandwivedi42 / adversarial
View on GitHub
Pytorch - Adversarial Training
☆26May 9, 2018Updated 8 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
PythonNut / superbpe
View on GitHub
Official code release for "SuperBPE: Space Travel for Language Models"
☆97May 28, 2026Updated last month
joaanna / disentangling_spelling_in_clip
View on GitHub
☆36Jun 22, 2023Updated 3 years ago