Python script for manipulating the existing tokenizer.
☆21Mar 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for Tokenizer-Changer
Users that are interested in Tokenizer-Changer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 3 years ago
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- ☆24Apr 3, 2025Updated 11 months ago
- String Distance using cython☆13Jan 19, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- R package for phonetic research and experimenting☆20Jul 29, 2024Updated last year
- ☆88Jun 1, 2023Updated 2 years ago
- A different, but useful, textcat approach.☆18Jul 15, 2024Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated last year
- ☆134Jan 22, 2026Updated 2 months ago
- (BMVC2021, Oral) The repository offers the official implementation of our BMVC 2021 paper (oral) in PyTorch.☆18Apr 22, 2022Updated 3 years ago
- NOAH's Corpus: Part-of-Speech Tagging for Swiss German☆12Jan 6, 2023Updated 3 years ago
- Python port for IWNLP.Lemmatizer☆18Oct 18, 2023Updated 2 years ago
- Tracking battery electric car adoption by sales and market share☆23Mar 8, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Nanyang Technological University - Multilingual Corpus (STB subcorpora)☆12Mar 11, 2019Updated 7 years ago
- Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2023, UC Berkeley)☆17Nov 20, 2023Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆60Feb 12, 2025Updated last year
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- ☆89Jan 28, 2026Updated 2 months ago
- Muon fsdp 2☆56Aug 8, 2025Updated 7 months ago
- German lemmatization with IWNLP as extension for spaCy☆27Jul 28, 2023Updated 2 years ago
- Contrastive Chain-of-Thought Prompting☆69Nov 18, 2023Updated 2 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Aug 26, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- VELMA agent for VLN in Street View☆30Sep 29, 2023Updated 2 years ago
- Abstractive Summarization IJCAI paper code☆28Oct 4, 2018Updated 7 years ago
- [SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…☆17Oct 5, 2024Updated last year
- ☆31Nov 23, 2022Updated 3 years ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆45May 20, 2025Updated 10 months ago
- a lightweight no-dependency fork from transformers.js (only tokenizers)☆32Jan 21, 2026Updated 2 months ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 💾A moleculer service mixin for minio and S3 💾☆15Sep 16, 2022Updated 3 years ago
- ☆105Mar 1, 2026Updated 3 weeks ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆80Sep 22, 2022Updated 3 years ago
- ☆18Oct 23, 2025Updated 5 months ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- We introduce EfficientRAG, an efficient retriever for multi-hop question answering. EfficientRAG iteratively generates new queries withou…☆17Mar 4, 2025Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆66Aug 15, 2025Updated 7 months ago