Lightweight piece tokenization library
☆12Apr 15, 2024Updated 2 years ago
Alternatives and similar repositories for curated-tokenizers
Users that are interested in curated-tokenizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wrapper for the macOS signpost API☆17Apr 24, 2023Updated 3 years ago
- Modular Rust transformer/LLM library using Candle☆38May 5, 2024Updated last year
- Central hub for demos, code snippets, and other assets for Azure Cosmos DB for AI apps.☆13Apr 9, 2025Updated last year
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆37Mar 27, 2024Updated 2 years ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- Fine-grained sentiment annotations of NoReC☆20Aug 1, 2022Updated 3 years ago
- Plug-and-play document AI with zero-shot models.☆124Feb 16, 2026Updated 2 months ago
- Experimental plugin to add support for RSS and JSON feeds to TiddlyWiki☆10Jan 9, 2022Updated 4 years ago
- Read and modify constituency trees in Rust.☆10May 5, 2020Updated 5 years ago
- A conda-smithy repository for spacy.☆14Apr 23, 2026Updated last week
- Benchmark Datasets for BioNLP Tasks☆17May 7, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Oct 16, 2019Updated 6 years ago
- a settings tool for changing css properties and variables☆14Mar 6, 2018Updated 8 years ago
- KenLM extension for spaCy 2.0.☆16Dec 6, 2017Updated 8 years ago
- Julia interface for SpaCy NLP library☆14Apr 22, 2018Updated 8 years ago
- ☆10Oct 27, 2022Updated 3 years ago
- Prodigy thing(z)☆12Mar 22, 2018Updated 8 years ago
- Jekyll skeleton theme for a personal blog☆12May 26, 2016Updated 9 years ago
- Massive Wiki - wikis made of Markdown Shared Versioned Files☆14Mar 18, 2026Updated last month
- A pre-commit hook for Pyrefly.☆24Apr 21, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Markdown extension to expand directives to include source example files to also include their variants. Only useful to tiangolo's projets…☆16Apr 17, 2026Updated 2 weeks ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆101Dec 26, 2024Updated last year
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Jan 31, 2018Updated 8 years ago
- A Prosody XMPP plug and play server☆11Apr 25, 2024Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- ☆29Nov 18, 2025Updated 5 months ago
- CLI to manage internationalizing your Titanium app☆24Aug 13, 2025Updated 8 months ago
- Confection: the sweetest config system for Python☆193Mar 27, 2026Updated last month
- Citar HMM part-of-speech tagger☆15Aug 29, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Python wrapper for the bioRxiv API.☆11Aug 18, 2021Updated 4 years ago
- ☆12Apr 12, 2024Updated 2 years ago
- Kernel sources for https://huggingface.co/kernels-community☆107Updated this week
- framework-wizio-pico☆14Oct 22, 2022Updated 3 years ago
- ☆15May 8, 2019Updated 6 years ago
- automatically generates your project's coverage badge using the shields.io service, and then updates your README☆12Updated this week
- Template for Python-based data science projects in the Alexandra Institute.☆12Updated this week