DolbyUUU / byte_pair_encoding_BPE_subword_tokenization_implementation_pythonView external linksLinks
Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
☆18Jan 30, 2023Updated 3 years ago
Alternatives and similar repositories for byte_pair_encoding_BPE_subword_tokenization_implementation_python
Users that are interested in byte_pair_encoding_BPE_subword_tokenization_implementation_python are comparing it to the libraries listed below
Sorting:
- ☆10Jan 20, 2023Updated 3 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- For loops in const☆13Sep 7, 2024Updated last year
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Feb 5, 2026Updated last week
- ☆10Jun 17, 2020Updated 5 years ago
- ☆13Feb 18, 2023Updated 2 years ago
- Elm Set built on top of AnyDict☆10Aug 12, 2024Updated last year
- 开放中文转换 - 简繁转换之通用规范汉字标准☆13Jan 27, 2026Updated 2 weeks ago
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago
- char <-> Unicode character name (maintained fork of huonw/unicode_names)☆12Sep 7, 2025Updated 5 months ago
- Touch event package for Elm lang☆10Nov 26, 2018Updated 7 years ago
- A Flutter package containing highly customizable island-style buttons known as Chiclet.☆13Apr 22, 2024Updated last year
- Course asset for the VR Developer Nanodegree > VR Scenes & Objects > Game Objects lesson☆11Jun 28, 2022Updated 3 years ago
- Rust tool to get info from your lycamobile.es account☆10Apr 29, 2021Updated 4 years ago
- Safely running potentially non-terminating functions in Elm.☆10Apr 20, 2021Updated 4 years ago
- ☆13Dec 23, 2020Updated 5 years ago
- Guetzli Windows, Guetzli Batch, Guetzli Frontend Windows, Guetzli Gui Windows, for the program Guetzli Perceptual JPEG encoder https://g…☆11May 15, 2020Updated 5 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- A one-file script which uses FontForge to convert a folder of svg files to a ttf font.☆14Jul 6, 2024Updated last year
- BOSshell for the TI-84+CE graphing calculator☆10Jul 31, 2020Updated 5 years ago
- Additional color handling for Elm☆13Mar 7, 2019Updated 6 years ago
- a flutter plugin for pocketsphinx on Android and iOS☆10Jan 12, 2019Updated 7 years ago
- ☆14Feb 2, 2018Updated 8 years ago
- A calculator written in the elm language☆13Oct 31, 2022Updated 3 years ago
- A basic Electron App using elm☆14Oct 4, 2017Updated 8 years ago
- An implementation of "A Typed, Algebraic Approach to Parsing"☆11Mar 21, 2022Updated 3 years ago
- Vscode extension to add ez80 assembly support☆11Jun 20, 2021Updated 4 years ago
- あばばばばばばばばば☆10Mar 16, 2019Updated 6 years ago
- WIP rust bindings for Awesomium browser☆11Jun 24, 2016Updated 9 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Jun 5, 2017Updated 8 years ago
- Write JSON decoders in Elm using continuation-style.☆16Apr 11, 2023Updated 2 years ago
- Simple Entity-Component System☆11Apr 21, 2015Updated 10 years ago
- Full functional Chinese Input Method for Android, using HeChinese coding system.☆11Aug 14, 2016Updated 9 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- Zilog eZ80, Z80 and Intel 8080 emulator library for Rust that passes all ZEXALL tests☆13Jan 9, 2026Updated last month
- List of Stock Symbols with CUSIP ID number☆19Aug 12, 2025Updated 6 months ago
- An offline Rust thesaurus library.☆12Aug 13, 2022Updated 3 years ago