Targetted language identifier, based on FastText and Hunspell.
☆38Sep 4, 2025Updated 7 months ago
Alternatives and similar repositories for fastspell
Users that are interested in fastspell are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 7 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 2 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 2 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆75Apr 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Program used to split text into segments☆28Oct 27, 2024Updated last year
- Transform TMX to text☆28Nov 23, 2022Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated last month
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆19Nov 23, 2024Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Dec 6, 2022Updated 3 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Material for a course on Advanced NLP☆15Jul 22, 2025Updated 8 months ago
- Data Collection System For NLP/Speech Recognition☆25Apr 20, 2021Updated 4 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated last year
- The source code for the TIRA Shared Task Platform☆17Updated this week
- A Sphinx theme for the CrateDB documentation.☆22Apr 3, 2026Updated last week
- Stuttgart Finite State Transducer system☆25Aug 9, 2025Updated 8 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Finite-state script normalization and processing utilities☆47Mar 31, 2026Updated last week
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A small wrapper around python logging module which can easily format and write logs to file.☆12Jan 9, 2023Updated 3 years ago
- ☆82Jan 30, 2026Updated 2 months ago
- An unbounded and bounded queue for concurrent access.☆10Apr 27, 2022Updated 3 years ago
- Data type isomorphic to α ∨ β ∨ (α ∧ β)☆14Apr 27, 2022Updated 3 years ago
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- Bicleaner fork that uses neural networks☆40Feb 23, 2026Updated last month
- Relational Scheme interpreter, written in miniKanren, with Scheme pattern matcher☆11Mar 17, 2015Updated 11 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Jun 26, 2023Updated 2 years ago
- Formulaire en ligne qui génère une attestation de déplacement dérogatoire☆10Mar 18, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- microKanren sagittarius/larceny☆11Jun 13, 2015Updated 10 years ago
- PureScript Erlang hello world☆13Aug 3, 2018Updated 7 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- Crawler based on a modified browser to detect online tracking.☆11Jul 19, 2023Updated 2 years ago
- Quickly estimate the similarity between many sets☆53Dec 3, 2022Updated 3 years ago
- WProofreader software development kit (SDK) offers multilingual spelling & grammar check API and JavaScript libraries for rich text edito…☆13Mar 30, 2026Updated last week
- Rule-based Kurdish Transliterator☆10May 3, 2024Updated last year