Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
☆17Jan 30, 2023Updated 3 years ago
Alternatives and similar repositories for byte_pair_encoding_BPE_subword_tokenization_implementation_python
Users that are interested in byte_pair_encoding_BPE_subword_tokenization_implementation_python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parse PDL file for the Chrome DevTools Protocol☆14Aug 12, 2019Updated 6 years ago
- WIP rust bindings for Awesomium browser☆11Jun 24, 2016Updated 9 years ago
- あばばばばばばばばば☆10Mar 16, 2019Updated 7 years ago
- ☆18Mar 26, 2015Updated 11 years ago
- Pretty printing for ImmutableJS☆12Jun 15, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper attend, copy, parse - End-to-end information extraction from documents (https://arxiv.org/pdf/1812.07248.pdf)☆13Jun 2, 2022Updated 3 years ago
- Safely running potentially non-terminating functions in Elm.☆10Apr 20, 2021Updated 5 years ago
- ☆13Dec 23, 2020Updated 5 years ago
- Rust library for requesting certificates from an ACME provider☆19Apr 28, 2026Updated last week
- Configuration files☆12Jan 21, 2024Updated 2 years ago
- Simple Entity-Component System☆11Apr 21, 2015Updated 11 years ago
- A list of regular expressions for zipcodes☆12Sep 29, 2019Updated 6 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Jun 5, 2017Updated 8 years ago
- Touch event package for Elm lang☆10Nov 26, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jan 20, 2023Updated 3 years ago
- Additional color handling for Elm☆13Mar 7, 2019Updated 7 years ago
- For loops in const☆13Sep 7, 2024Updated last year
- A basic Electron App using elm☆14Oct 4, 2017Updated 8 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 2 years ago
- char <-> Unicode character name (maintained fork of huonw/unicode_names)☆12Sep 7, 2025Updated 8 months ago
- Raw rust bindings to the enet C library☆21Mar 16, 2026Updated last month
- Plugin to setup postgresql accounts for containers deployed with Dokku☆35Sep 30, 2015Updated 10 years ago
- Rust tool to get info from your lycamobile.es account☆10Apr 29, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Alternate implementations of vector/map/set for Rust☆15Apr 27, 2023Updated 3 years ago
- dMel: Speech Tokenization Made Simple☆20May 13, 2025Updated 11 months ago
- ☆13Feb 18, 2023Updated 3 years ago
- An offline Rust thesaurus library.☆12Aug 13, 2022Updated 3 years ago
- char <-> Unicode character name☆23Aug 20, 2016Updated 9 years ago
- ☆10Jun 17, 2020Updated 5 years ago
- ☆12Sep 25, 2022Updated 3 years ago
- ☆10Nov 25, 2022Updated 3 years ago
- ☆14Nov 6, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Additional basic functions for Elm.☆15Feb 23, 2023Updated 3 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- Elixir client for the Docker Remote API☆27Jul 29, 2021Updated 4 years ago
- refinement types for Elm☆16Jul 12, 2023Updated 2 years ago
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Updated this week
- An implementation of "A Typed, Algebraic Approach to Parsing"☆11Mar 21, 2022Updated 4 years ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago