google / cld3
☆810Updated last year
Alternatives and similar repositories for cld3:
Users that are interested in cld3 are comparing it to the libraries listed below
- Compact Language Detector 2☆851Updated 3 years ago
- Training open neural machine translation models☆354Updated 6 months ago
- Bitextor generates translation memories from multilingual websites☆291Updated 4 months ago
- Modern spell checking library - accurate, fast, multi-language☆630Updated 6 months ago
- Heuristic based boilerplate removal tool☆758Updated 2 weeks ago
- Tools to download and cleanup Common Crawl data☆992Updated last year
- NeuSpell: A Neural Spelling Correction Toolkit☆690Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,270Updated this week
- Language-Agnostic SEntence Representations☆3,619Updated 10 months ago
- ☆168Updated 9 months ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆821Updated this week
- ☆500Updated last year
- Fast Neural Machine Translation in C++☆1,300Updated last year
- Simple, fast unsupervised word aligner☆748Updated 2 years ago
- Port of Google's language-detection library to Python.☆1,765Updated last week
- A python module for English lemmatization and inflection.☆265Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 6 months ago
- A neural word aligner based on multilingual BERT☆339Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,204Updated 5 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated last month
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆375Updated 3 months ago
- Python bindings for cld3☆27Updated last year
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆748Updated 6 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆490Updated 9 months ago
- Improved Sentence Alignment in Linear Time and Space☆167Updated 2 years ago
- Stand-alone language identification system☆2,365Updated 5 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆736Updated last week
- Facebook Low Resource (FLoRes) MT Benchmark☆722Updated last year