bnosac / tokenizers.bpe
R package for Byte Pair Encoding based on YouTokenToMe
☆15Updated last year
Alternatives and similar repositories for tokenizers.bpe:
Users that are interested in tokenizers.bpe are comparing it to the libraries listed below
- Efficient learning of word representations☆22Updated 4 years ago
- A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction…☆20Updated 6 years ago
- allowing R users to work with dlib through Rcpp☆13Updated 6 years ago
- Download and cache HuggingFace Hub files☆16Updated 5 months ago
- Automate Scaffolding R Interfaces to Packages in Other Programming Languages☆27Updated last year
- R interface to Vowpal Wabbit☆22Updated 5 years ago
- Combining drake workflows with R package development to train and execute a machine learning model☆14Updated 4 years ago
- torch from R!☆51Updated 4 years ago
- ☆46Updated 6 years ago
- R package for 'Efficient Learning of Word Representations and Sentence Classification'☆42Updated last year
- R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece☆25Updated 2 years ago
- 📐Julia's implementation of word2vec in R☆24Updated 5 years ago
- Tidy BERT-like Models☆21Updated last year
- Hugging face tokenizers for R using extendr☆11Updated last year
- ☆28Updated 3 years ago
- Word Factor Vectors☆33Updated 5 years ago
- Tools for Data Manipulation in R☆16Updated 4 years ago
- This is a question-output workflow template for shiny app!☆12Updated 5 years ago
- reprex + slack☆31Updated 2 years ago
- Machine learning explanations☆22Updated 7 months ago
- Tools for Standardizing Variables for Regression in R☆23Updated 4 years ago
- Materials for the "Apps and Dashboards with Shiny " workshop at WSDS 2018☆19Updated 5 years ago
- An easier way to tidying pivoted tables.☆29Updated 4 years ago
- A small message queue for interprocess communication☆23Updated 3 years ago
- Printable Tibbles☆15Updated 6 years ago
- An R-package to build nesting or hierarchical structures☆13Updated 3 years ago
- quanteda textmodel extensions for classifying documents☆21Updated last year
- Cheap R functions to save time and memory☆20Updated this week
- Sampling Methods for Big Data☆10Updated 6 years ago
- Text Processing for Small or Big Data Files in R☆38Updated last year