Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
☆18Jan 30, 2023Updated 3 years ago
Alternatives and similar repositories for byte_pair_encoding_BPE_subword_tokenization_implementation_python
Users that are interested in byte_pair_encoding_BPE_subword_tokenization_implementation_python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 8 months ago
- あばばばばばばばばば☆10Mar 16, 2019Updated 7 years ago
- Pretty printing for ImmutableJS☆12Jun 15, 2016Updated 9 years ago
- Safely running potentially non-terminating functions in Elm.☆10Apr 20, 2021Updated 4 years ago
- Configuration files☆12Jan 21, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A list of regular expressions for zipcodes☆12Sep 29, 2019Updated 6 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Jun 5, 2017Updated 8 years ago
- Elm Set built on top of AnyDict☆10Aug 12, 2024Updated last year
- Touch event package for Elm lang☆10Nov 26, 2018Updated 7 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- ☆13Jul 17, 2021Updated 4 years ago
- Additional color handling for Elm☆13Mar 7, 2019Updated 7 years ago
- A basic Electron App using elm☆14Oct 4, 2017Updated 8 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- char <-> Unicode character name (maintained fork of huonw/unicode_names)☆12Sep 7, 2025Updated 6 months ago
- Raw rust bindings to the enet C library☆21Mar 16, 2026Updated last week
- A calculator written in the elm language☆13Oct 31, 2022Updated 3 years ago
- Rust tool to get info from your lycamobile.es account☆10Apr 29, 2021Updated 4 years ago
- Alternate implementations of vector/map/set for Rust☆15Apr 27, 2023Updated 2 years ago
- A demo project for using `emulators` to generate screenshots for a Flutter project☆13Feb 24, 2022Updated 4 years ago
- ☆13Feb 18, 2023Updated 3 years ago
- An offline Rust thesaurus library.☆12Aug 13, 2022Updated 3 years ago
- char <-> Unicode character name☆23Aug 20, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Jun 17, 2020Updated 5 years ago
- ☆12Sep 25, 2022Updated 3 years ago
- ☆14Nov 6, 2017Updated 8 years ago
- Additional basic functions for Elm.☆15Feb 23, 2023Updated 3 years ago
- 韩语输入法 RIME IME schema for typing Korean Hangul and Hanja☆13Jul 10, 2020Updated 5 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- refinement types for Elm☆16Jul 12, 2023Updated 2 years ago
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Updated this week
- An implementation of "A Typed, Algebraic Approach to Parsing"☆11Mar 21, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Feb 2, 2019Updated 7 years ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago
- Loengfan (粵語兩分) is the Cantonese version of the Liang Fen input method☆15Mar 3, 2022Updated 4 years ago
- Write JSON decoders in Elm using continuation-style.☆16Apr 11, 2023Updated 2 years ago
- Modern and elegant test framework for Flutter, inspired by Cypress☆18May 4, 2022Updated 3 years ago
- A dropdown component for Elm☆12Jan 28, 2019Updated 7 years ago
- Pluralization handling in Rust☆41Nov 4, 2022Updated 3 years ago