Vocabulary list of GPT-4o (o200k_base) and GPT-4/GPT-3.5 (cl100k_base) tokenizers. Special tokens are excluded.
☆68May 14, 2024Updated 2 years ago
Alternatives and similar repositories for gpt4_vocab_list
Users that are interested in gpt4_vocab_list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Metadata browser of TREC☆10May 19, 2026Updated 3 weeks ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- ☆11Nov 11, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆40Oct 7, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆60Oct 1, 2025Updated 8 months ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- This repository contains codes for *Sem 2023 paper “Generative Data Augmentation for Aspect Sentiment Quad Prediction”.☆10May 30, 2023Updated 3 years ago
- pytorch实现bert做seq2seq任务 ,使用unilm方案。☆10Apr 1, 2020Updated 6 years ago
- [Neurips 2025]StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models☆32Dec 4, 2025Updated 6 months ago
- ☆10Dec 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- A page scraping DSL for extracting structured information from unstructured XHTML, built on Node.js and jQuery☆49Jan 9, 2015Updated 11 years ago
- Official Code for Efficient and Effective Augmentation Strategy for Adversarial Training (NeurIPS-2022)☆17Mar 29, 2023Updated 3 years ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆96May 6, 2026Updated last month
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- UI for ActivityWatch. Include category editor and viewer for multiple categorizations.☆10Jan 31, 2024Updated 2 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- ☆16Apr 27, 2024Updated 2 years ago
- Library for soft prompt tuning☆22Jun 2, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Dec 23, 2022Updated 3 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆17Apr 24, 2024Updated 2 years ago
- A patient matching test harness to support PCOR☆16Feb 28, 2017Updated 9 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- ☆22May 27, 2020Updated 6 years ago
- Enemies for your LLM☆37Jan 20, 2026Updated 4 months ago
- reddit's python experiments framework☆12Apr 28, 2025Updated last year
- A generic GLSL post-processing module for applying super-speedy GPU effects to img/video/canvas elements.☆27Aug 20, 2017Updated 8 years ago
- Atom Editor plugin for tidalcycles(Since tidal version 0.8, the offical atom plugin atom-tidalcycles should be used)☆14Mar 14, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"☆15Jun 21, 2024Updated last year
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- ☆139Jul 7, 2025Updated 11 months ago
- Add & update simple literature notes from Zotero.☆25May 21, 2026Updated 2 weeks ago
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆20Jun 27, 2025Updated 11 months ago
- some config files☆14Feb 23, 2026Updated 3 months ago