javirandor / anthropic-tokenizerLinks

Approximation of the Claude 3 tokenizer by inspecting generation stream

☆131

Alternatives and similar repositories for anthropic-tokenizer

Users that are interested in anthropic-tokenizer are comparing it to the libraries listed below

Sorting:

teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆242Updated 5 months ago
kenshin9000 / ConceptARC-Representations
This repository explains and provides examples for "concept anchoring" in GPT4.
☆72Updated last year
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 6 months ago
teknium1 / ShareGPT-Builder
☆116Updated 7 months ago
SpellcraftAI / oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
☆99Updated last year
QuixiAI / OpenChatML
☆157Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 10 months ago
reactorsh / ambrosia
clean up your LLM datasets
☆115Updated 2 years ago
haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆95Updated 3 months ago
GoodAI / goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…
☆76Updated 7 months ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
normal-computing / extended-mind-transformers
☆123Updated last year
aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆307Updated last month
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 3 weeks ago
angelina-yang / Claude_API_Contest
Claude API Test Project
☆87Updated last year
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated 2 years ago
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆102Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
joshuacnf / Ctrl-G
☆88Updated 7 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
catid / self-discover
Implementation of Google's SELF-DISCOVER
☆298Updated last year