Targetted language identifier, based on FastText and Hunspell.
☆38Sep 4, 2025Updated 9 months ago
Alternatives and similar repositories for fastspell
Users that are interested in fastspell are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tool to fix bitexts and tag near-duplicates for removal☆35Sep 4, 2025Updated 9 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 4 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆76Apr 1, 2025Updated last year
- [LREC 2024] 🖋 Resource and Tool for Writing System Identification☆22Mar 29, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Program used to split text into segments☆28Oct 27, 2024Updated last year
- Transform TMX to text☆27Nov 23, 2022Updated 3 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated last month
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆14Jul 6, 2022Updated 3 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆20Nov 23, 2024Updated last year
- Finite state compiler, processor and helper tools used by apertium☆21May 7, 2026Updated last month
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Dec 6, 2022Updated 3 years ago
- Data Collection System For NLP/Speech Recognition☆25Apr 20, 2021Updated 5 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13May 29, 2026Updated last week
- A Sphinx theme for the CrateDB documentation.☆22Jun 3, 2026Updated last week
- ☆11Oct 28, 2022Updated 3 years ago
- Finite-state script normalization and processing utilities☆50Jun 3, 2026Updated last week
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- Fast Neural Machine Translation in C++ - development repository☆23May 12, 2024Updated 2 years ago
- ☆23Jan 25, 2023Updated 3 years ago
- ☆82Jan 30, 2026Updated 4 months ago
- An unbounded and bounded queue for concurrent access.☆10Apr 27, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for constructing TLDR corpus from Reddit dataset☆27Nov 23, 2021Updated 4 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆25Oct 27, 2023Updated 2 years ago
- Bicleaner fork that uses neural networks☆40Feb 23, 2026Updated 3 months ago
- Lossless normalization of uppercase characters: Go, C++ & JavaScript☆11Updated this week
- A huge number library for Purescript with emphasis on correctness.☆12Apr 27, 2022Updated 4 years ago
- Relational Scheme interpreter, written in miniKanren, with Scheme pattern matcher☆11Mar 17, 2015Updated 11 years ago
- Game Boy Clock Accuracy Challenge☆13Mar 30, 2023Updated 3 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- microKanren sagittarius/larceny☆11Jun 13, 2015Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- PureScript version management in PureScript.☆14Jan 27, 2023Updated 3 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- Crawler based on a modified browser to detect online tracking.☆11Jul 19, 2023Updated 2 years ago
- A set of utilities for processing MediaWiki SQL dump data☆20Feb 19, 2024Updated 2 years ago
- Demos of how to use Poetry to build various C/C++ extensions for Python.☆25May 28, 2024Updated 2 years ago
- Hyphenation of English words☆13Dec 21, 2016Updated 9 years ago