Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
☆18Jan 30, 2023Updated 3 years ago
Alternatives and similar repositories for byte_pair_encoding_BPE_subword_tokenization_implementation_python
Users that are interested in byte_pair_encoding_BPE_subword_tokenization_implementation_python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parse PDL file for the Chrome DevTools Protocol☆14Aug 12, 2019Updated 6 years ago
- WIP rust bindings for Awesomium browser☆11Jun 24, 2016Updated 9 years ago
- Snappy bindings for Rust☆16Nov 22, 2017Updated 8 years ago
- あばばばばばばばばば☆10Mar 16, 2019Updated 7 years ago
- Pretty printing for ImmutableJS☆12Jun 15, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper attend, copy, parse - End-to-end information extraction from documents (https://arxiv.org/pdf/1812.07248.pdf)☆13Jun 2, 2022Updated 4 years ago
- Safely running potentially non-terminating functions in Elm.☆10Apr 20, 2021Updated 5 years ago
- Rust library for requesting certificates from an ACME provider☆19Apr 28, 2026Updated last month
- Touch event package for Elm lang☆10Nov 26, 2018Updated 7 years ago
- ☆14Jan 20, 2023Updated 3 years ago
- ☆13Jul 17, 2021Updated 4 years ago
- Additional color handling for Elm☆13Mar 7, 2019Updated 7 years ago
- For loops in const☆13Sep 7, 2024Updated last year
- A basic Electron App using elm☆14Oct 4, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 3 years ago
- char <-> Unicode character name (maintained fork of huonw/unicode_names)☆12Jun 4, 2026Updated 2 weeks ago
- Emulated server for the Mu Online Season 6 Episode 3 client.☆15Feb 17, 2015Updated 11 years ago
- Raw rust bindings to the enet C library☆21Mar 16, 2026Updated 3 months ago
- Rust tool to get info from your lycamobile.es account☆10Apr 29, 2021Updated 5 years ago
- Alternate implementations of vector/map/set for Rust☆15Apr 27, 2023Updated 3 years ago
- char <-> Unicode character name☆25Aug 20, 2016Updated 9 years ago
- ☆12Sep 25, 2022Updated 3 years ago
- ☆10Jun 17, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Nov 25, 2022Updated 3 years ago
- ☆14Nov 6, 2017Updated 8 years ago
- 开放中文转换 - 简繁转换之通用规范汉字标准☆19Updated this week
- Additional basic functions for Elm.☆15Feb 23, 2023Updated 3 years ago
- 韩语输入法 RIME IME schema for typing Korean Hangul and Hanja☆12Jul 10, 2020Updated 5 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- refinement types for Elm☆16Jul 12, 2023Updated 2 years ago
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Jun 11, 2026Updated last week
- An implementation of "A Typed, Algebraic Approach to Parsing"☆11Mar 21, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Feb 2, 2019Updated 7 years ago
- Create svg path with each point can have variable width.☆17Feb 13, 2025Updated last year
- Tools and prompt templates used to build and evaluate SWE-rebench-v2 tasks for the paper.☆63Mar 12, 2026Updated 3 months ago
- Loengfan (粵語兩分) is the Cantonese version of the Liang Fen input method☆15Mar 3, 2022Updated 4 years ago
- Write JSON decoders in Elm using continuation-style.☆16Apr 11, 2023Updated 3 years ago
- Modern and elegant test framework for Flutter, inspired by Cypress☆18May 4, 2022Updated 4 years ago
- Pluralization handling in Rust☆41Nov 4, 2022Updated 3 years ago