Vocabulary list of GPT-4o (o200k_base) and GPT-4/GPT-3.5 (cl100k_base) tokenizers. Special tokens are excluded.
☆68May 14, 2024Updated 2 years ago
Alternatives and similar repositories for gpt4_vocab_list
Users that are interested in gpt4_vocab_list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Construction Grammar based BERT☆14Dec 5, 2020Updated 5 years ago
- ☆16Apr 14, 2021Updated 5 years ago
- ☆15Aug 2, 2024Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆13Sep 19, 2024Updated last year
- ☆15Aug 7, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A paper list for box embeddings☆17Jun 9, 2021Updated 5 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆40Oct 7, 2025Updated 8 months ago
- Implementation of "Neural Word Embedding as Implicit Matrix Factorization"☆14Mar 17, 2022Updated 4 years ago
- Modified version of T5-DST for Dialogue State Tracking.☆19Dec 10, 2021Updated 4 years ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- ☆19Nov 4, 2025Updated 7 months ago
- ☆22Jan 11, 2023Updated 3 years ago
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Aug 6, 2022Updated 3 years ago
- PCIe GEN1, GEN2 and GEN3 Scrambler, This Scrambler is able to scramble 1,2 and 4 bytes of data in 1 clock cycle in respect to the scrambl…☆18Jul 5, 2025Updated 11 months ago
- Lexical semantic change detection shared task at SemEval 2020: UiO-UVA team☆16Jan 10, 2023Updated 3 years ago
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- Help creating image dataset for machine learning.☆10Nov 4, 2020Updated 5 years ago
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 10 months ago
- [Neurips 2025]StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models☆33Dec 4, 2025Updated 6 months ago
- ☆13Jan 9, 2022Updated 4 years ago
- ML algorithms implementations that are good for learning the underlying principles☆28Dec 7, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- CS583 course project☆14Dec 3, 2017Updated 8 years ago
- ☆10Jul 28, 2020Updated 5 years ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Jun 14, 2026Updated 2 weeks ago
- A page scraping DSL for extracting structured information from unstructured XHTML, built on Node.js and jQuery☆49Jan 9, 2015Updated 11 years ago
- yet another anki app☆14Sep 9, 2024Updated last year
- bevy_openai is an event-driven plugin for Bevy that provides convenient access to the OpenAI API.☆12Jan 28, 2024Updated 2 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 3 years ago
- Code for "Improving Expert Predictions with Conformal Prediction" , ICML 2023☆14Aug 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- ☆15Apr 27, 2024Updated 2 years ago
- ☆13May 21, 2024Updated 2 years ago
- Easy Application Development with React JavaScript☆16Sep 28, 2013Updated 12 years ago
- ☆17Dec 23, 2022Updated 3 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆17Apr 24, 2024Updated 2 years ago
- MacBook Pro keyboard written in SwiftUI.☆12Jan 19, 2021Updated 5 years ago