fast bpe tokenizer, simple to understand, easy to use
☆28Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for fast_bpe_tokenizer
Users that are interested in fast_bpe_tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- ☆12Nov 19, 2022Updated 3 years ago
- ☆12Mar 31, 2020Updated 6 years ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 3 years ago
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆15Jun 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆40Oct 7, 2025Updated 8 months ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- ☆15Mar 12, 2024Updated 2 years ago
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated 4 months ago
- fastNLP reimplementation of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction"☆11Dec 11, 2020Updated 5 years ago
- Convert pdf to pages of images☆13Apr 18, 2020Updated 6 years ago
- Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”☆10Apr 3, 2022Updated 4 years ago
- ☆11Nov 16, 2022Updated 3 years ago
- 《算法竞赛进阶指南》(Algorithm Competition Advanced Guide)中例题和习题的练习代码☆16Sep 24, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Nov 25, 2025Updated 6 months ago
- mugene MML to SMF compiler. It's dead, Switch to atsushieno/mugene-ng. / mugene guide bookサポートはこちらへ https://github.com/atsushieno/mugene-…☆16Oct 21, 2020Updated 5 years ago
- A rewritten version of C++ Design Patterns and Derivatives Pricing coded in Python☆10Sep 16, 2019Updated 6 years ago
- ☆13Jan 6, 2023Updated 3 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆15Aug 7, 2022Updated 3 years ago
- ☆11Sep 8, 2024Updated last year
- ☆15Dec 28, 2023Updated 2 years ago
- A declarative component for conditional rendering.☆14Apr 21, 2026Updated last month
- Midi2PLAY is an application that helps the process of converting MIDI files (.mid) making them compatible with the syntax accepted by the…☆10Dec 30, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 本科毕业论文、源码及相关材料☆15Dec 30, 2019Updated 6 years ago
- Native macOS PII removal software with on device Vision OCR and OpenAI privacy filter model☆180May 5, 2026Updated last month
- ☆18May 28, 2021Updated 5 years ago
- Bioinfo Training Program @ Lu Lab☆15Apr 17, 2019Updated 7 years ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Nov 29, 2023Updated 2 years ago
- Examples of assembly programming for MS-DOS☆16Jan 2, 2021Updated 5 years ago
- The Latex template for BNU thesis, revised based on http://gerry.lamost.org/blog/?p=811☆10Mar 13, 2019Updated 7 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collaborative workspace where humans and AI agents work together in shared channels☆201May 7, 2026Updated last month
- (Moved to Codeberg!) Retro graphics programming in 16bit style - using a modern tool-chain☆16Jul 23, 2025Updated 10 months ago
- ☆26Jun 10, 2025Updated 11 months ago
- ☆15Jul 23, 2020Updated 5 years ago
- Corpus for Universal Conceptual Cognitive Annotation☆13Mar 5, 2021Updated 5 years ago
- Changelog manipulation for Dart☆14Dec 18, 2024Updated last year
- List of papers on Self-Correction of LLMs.☆81May 19, 2026Updated 3 weeks ago