Python3 bindings for the Compact Language Detector v3 (CLD3)
โ154Apr 19, 2026Updated last month
Alternatives and similar repositories for pycld3
Users that are interested in pycld3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- โ179Mar 28, 2025Updated last year
- [WWW 2026] ๐ธ GlotWeb: Web Indexing for Minority Languagesโ17Apr 14, 2026Updated last month
- Port of Google's language-detection library to Python.โ1,885Mar 3, 2025Updated last year
- Language detection using Spacy and Fasttextโ54Dec 17, 2023Updated 2 years ago
- Accurately find/replace/remove emojis in text stringsโ163Apr 26, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataโ95Feb 5, 2026Updated 3 months ago
- The official code repository for MetricMT - a reward optimization method for NMT with learned metricsโ25Apr 24, 2021Updated 5 years ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language textโ1,725Apr 23, 2026Updated last month
- Record my paper reading about Machine Translation and other related works.โ36Nov 19, 2021Updated 4 years ago
- Stand-alone language identification systemโ2,459Jan 1, 2020Updated 6 years ago
- Visual Hash for matching copies of visually similar images.โ16Mar 17, 2025Updated last year
- A fast python implementation of the SimHash algorithm.โ27Oct 27, 2021Updated 4 years ago
- Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"โ11Apr 13, 2022Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality โฆโ104Feb 26, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This project provides lists of various ISO standards (e.g. country, language, language scripts, and currency names) in one placeโ17Mar 27, 2026Updated 2 months ago
- โ21May 31, 2018Updated 7 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyโ201Jun 6, 2025Updated 11 months ago
- Text span utilities for Rust and Pythonโ23Jan 3, 2023Updated 3 years ago
- 80x faster and 95% accurate language identification with Fasttextโ168Jan 23, 2024Updated 2 years ago
- A Python library for calculating a large variety of metrics from textโ364May 5, 2026Updated 3 weeks ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.โ39Feb 5, 2026Updated 3 months ago
- Multilingual text (NLP) processing toolkitโ2,367Nov 10, 2023Updated 2 years ago
- โ23Aug 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Converts HTTrack crawls to WARC filesโ34Aug 6, 2024Updated last year
- โ24Nov 29, 2017Updated 8 years ago
- A Python wrapper for the ROUGE summarization evaluation packageโ14Aug 9, 2017Updated 8 years ago
- Converts Twitter threads to Markdown files with proper reply indentation.โ11Dec 8, 2022Updated 3 years ago
- โ11Jul 22, 2018Updated 7 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Exampleโ23Nov 28, 2021Updated 4 years ago
- โ34Jan 2, 2024Updated 2 years ago
- Experiment in automatic insertion of timed transcript correctionsโ21Oct 31, 2017Updated 8 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.โ261Apr 15, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.โ23Nov 9, 2025Updated 6 months ago
- A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUsโ10Oct 17, 2022Updated 3 years ago
- ๐ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyโ1,407Mar 27, 2026Updated last month
- Lightning Fast Language Prediction ๐โ168Aug 22, 2025Updated 9 months ago
- Experiment on metadata extraction using large language models such as GPT-3โ12Feb 1, 2023Updated 3 years ago
- Language-Agnostic SEntence Representationsโ3,662May 2, 2024Updated 2 years ago
- ๐งช Cutting-edge experimental spaCy components and featuresโ105Apr 23, 2024Updated 2 years ago