bhavnicksm / autotiktokenizer
π§° The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! β¨
β12Updated this week
Alternatives and similar repositories for autotiktokenizer:
Users that are interested in autotiktokenizer are comparing it to the libraries listed below
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersβ59Updated 4 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β85Updated 3 months ago
- Smart commit messagesβ18Updated last month
- A holistic evaluation library for multi-modal generative models using Weaveβ27Updated 3 weeks ago
- Utilities for PyTorch distributedβ23Updated last year
- NLP with Rust for Python π¦πβ59Updated 5 months ago
- β116Updated last month
- β36Updated 3 weeks ago
- JAX Implementation of Black Forest Labs' Flux.1 family of modelsβ15Updated last month
- Generalist and Lightweight Model for Text Classificationβ51Updated 2 weeks ago
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- β12Updated 11 months ago
- [WIP] A π₯ interface for running code in the cloudβ86Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated 10 months ago
- β73Updated 4 months ago
- Collection of autoregressive model implementationβ67Updated this week
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ46Updated 10 months ago
- Let's build better datasets, together!β209Updated this week
- Large scale 4D parallelism pre-training for π€ transformers in Mixture of Experts *(still work in progress)*β80Updated 11 months ago
- β69Updated this week
- π€ Trade any tensors over the networkβ30Updated last year
- β41Updated 3 weeks ago
- β45Updated 3 months ago
- Chunk your text using gpt4o-mini more accuratelyβ42Updated 3 months ago
- Efficient CUDA kernels for training convolutional neural networks with PyTorch.β35Updated last week
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptationβ31Updated last month
- A case study of efficient training of large language models using commodity hardware.β68Updated 2 years ago
- Tools to make language models a bit easier to useβ30Updated last week
- β94Updated 2 months ago