Python3 bindings for the Compact Language Detector v3 (CLD3)
β154Apr 19, 2026Updated last month
Alternatives and similar repositories for pycld3
Users that are interested in pycld3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β179Mar 28, 2025Updated last year
- [WWW 2026] πΈ GlotWeb: Web Indexing for Minority Languagesβ17Apr 14, 2026Updated 2 months ago
- Port of Google's language-detection library to Python.β1,890Mar 3, 2025Updated last year
- Language detection using Spacy and Fasttextβ54Dec 17, 2023Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.β38Sep 4, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ96Feb 5, 2026Updated 4 months ago
- Yet Another Z39.50-powered Chatbotβ13Oct 9, 2023Updated 2 years ago
- The official code repository for MetricMT - a reward optimization method for NMT with learned metricsβ25Apr 24, 2021Updated 5 years ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language textβ1,733Apr 23, 2026Updated last month
- Record my paper reading about Machine Translation and other related works.β36Nov 19, 2021Updated 4 years ago
- Stand-alone language identification systemβ2,460Jan 1, 2020Updated 6 years ago
- Visual Hash for matching copies of visually similar images.β16Mar 17, 2025Updated last year
- A fast python implementation of the SimHash algorithm.β27Oct 27, 2021Updated 4 years ago
- Residual Quantization Autoencoder, used for interpreting LLMsβ14Jan 1, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An example of graph embeddings for wikipedia page recommendationsβ11Aug 26, 2021Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β104Feb 26, 2024Updated 2 years ago
- Gem to allow easy access to data from the WIPO PATENTSCOPE Web Serviceβ18Updated this week
- β18Jan 26, 2023Updated 3 years ago
- β21May 31, 2018Updated 8 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ203Updated this week
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplateβ¦β53Jun 12, 2020Updated 6 years ago
- Text span utilities for Rust and Pythonβ23Jan 3, 2023Updated 3 years ago
- 80x faster and 95% accurate language identification with Fasttextβ168May 26, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Python library for calculating a large variety of metrics from textβ366May 5, 2026Updated last month
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.β39Feb 5, 2026Updated 4 months ago
- ISO 20275β10Updated this week
- Multilingual text (NLP) processing toolkitβ2,366Nov 10, 2023Updated 2 years ago
- Converts HTTrack crawls to WARC filesβ34Aug 6, 2024Updated last year
- β24Nov 29, 2017Updated 8 years ago
- A Python wrapper for the ROUGE summarization evaluation packageβ14Aug 9, 2017Updated 8 years ago
- Converts Twitter threads to Markdown files with proper reply indentation.β11Dec 8, 2022Updated 3 years ago
- β11Jul 22, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Test Collection of Computer Science Papers for Faceted Query by Exampleβ23Nov 28, 2021Updated 4 years ago
- β34Jan 2, 2024Updated 2 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.β264Updated this week
- LASER multilingual sentence embeddings as a pip packageβ225Aug 11, 2023Updated 2 years ago
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.β23Nov 9, 2025Updated 7 months ago
- A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUsβ10Oct 17, 2022Updated 3 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,406Mar 27, 2026Updated 2 months ago