bsolomon1124/pycld3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bsolomon1124/pycld3)

bsolomon1124 / pycld3

Python3 bindings for the Compact Language Detector v3 (CLD3)

☆154

Alternatives and similar repositories for pycld3

Users that are interested in pycld3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aboSamoor / pycld2
View on GitHub
☆179Mar 28, 2025Updated last year
Mimino666 / langdetect
View on GitHub
Port of Google's language-detection library to Python.
☆1,897Mar 3, 2025Updated last year
thomasthiebaud / spacy-fastlang
View on GitHub
Language detection using Spacy and Fasttext
☆54Dec 17, 2023Updated 2 years ago
bsolomon1124 / demoji
View on GitHub
Accurately find/replace/remove emojis in text strings
☆163Jul 12, 2026Updated 2 weeks ago
UB-Mannheim / spacyopentapioca
View on GitHub
A spaCy wrapper of OpenTapioca for named entity linking on Wikidata
☆96Feb 5, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nikkonrom / noshazambot
View on GitHub
Telegram music-guess bot with self-extention music database
☆10Dec 8, 2022Updated 3 years ago
naver-ai / MetricMT
View on GitHub
The official code repository for MetricMT - a reward optimization method for NMT with learned metrics
☆25Apr 24, 2021Updated 5 years ago
pemistahl / lingua-py
View on GitHub
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,766Updated this week
saffsd / langid.py
View on GitHub
Stand-alone language identification system
☆2,464Jan 1, 2020Updated 6 years ago
hybridtheory / floc-simhash
View on GitHub
A fast python implementation of the SimHash algorithm.
☆27Oct 27, 2021Updated 4 years ago
GoFigure-LANL / VisHash
View on GitHub
Visual Hash for matching copies of visually similar images.
☆16Mar 17, 2025Updated last year
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
kabirkhan / recon
View on GitHub
Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …
☆104Feb 26, 2024Updated 2 years ago
lyuchenyang / Document-level-Sentiment-Analysis-with-User-and-Product-Context
View on GitHub
Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"
☆11Apr 13, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cocaer / goNLP
View on GitHub
NLP moudle for Golang
☆13Jul 19, 2017Updated 9 years ago
zelandiya / maui-standalone
View on GitHub
☆21May 31, 2018Updated 8 years ago
dkpro / dkpro-c4corpus
View on GitHub
DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…
☆53Jun 12, 2020Updated 6 years ago
adbar / simplemma
View on GitHub
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
☆210Updated this week
tamuhey / textspan
View on GitHub
Text span utilities for Rust and Python
☆23Jan 3, 2023Updated 3 years ago
neuml / staticvectors
View on GitHub
🔢 Work with static vector models
☆39Apr 21, 2025Updated last year
zafercavdar / fasttext-langdetect
View on GitHub
80x faster and 95% accurate language identification with Fasttext
☆171May 26, 2026Updated 2 months ago
HLasse / TextDescriptives
View on GitHub
A Python library for calculating a large variety of metrics from text
☆366May 5, 2026Updated 2 months ago
Gawaboumga / iso-20275-python
View on GitHub
ISO 20275
☆10Jun 12, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flowdegree / swarmapp-api
View on GitHub
An NPM package to communicate with Swarmapp (foursquare) API
☆14Dec 2, 2024Updated last year
aboSamoor / polyglot
View on GitHub
Multilingual text (NLP) processing toolkit
☆2,364Nov 10, 2023Updated 2 years ago
paracrawl / extractor
View on GitHub
☆24Nov 29, 2017Updated 8 years ago
allenai / s2_fos
View on GitHub
☆34Jan 2, 2024Updated 2 years ago
alexnorton / overtyper
View on GitHub
Experiment in automatic insertion of timed transcript corrections
☆21Oct 31, 2017Updated 8 years ago
iesl / CSFCube
View on GitHub
A Test Collection of Computer Science Papers for Faceted Query by Example
☆23Nov 28, 2021Updated 4 years ago
yannvgn / laserembeddings
View on GitHub
LASER multilingual sentence embeddings as a pip package
☆225Aug 11, 2023Updated 2 years ago
AmenRa / indxr
View on GitHub
A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.
☆23Nov 9, 2025Updated 8 months ago
softcite / softcite_kb
View on GitHub
A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources
☆18May 14, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zwhe99 / FeedbackMT
View on GitHub
Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"
☆22Jun 28, 2024Updated 2 years ago
hiredscorelabs / seqtolang
View on GitHub
Multi-Langauge Identification
☆28Jul 25, 2024Updated 2 years ago
jtushman / dict_digger
View on GitHub
Digs into Dicts (lists and tuples)
☆15Jun 23, 2015Updated 11 years ago
lisasiyu / Cross-Align
View on GitHub
EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"
☆20Feb 19, 2023Updated 3 years ago
explosion / spacy-transformers
View on GitHub
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
☆1,408Mar 27, 2026Updated 3 months ago
creafz / kaggle-carvana
View on GitHub
Solution for the Carvana Image Masking Challenge on Kaggle. It uses a custom version of RefineNet with Squeeze-and-Excitation modules imp…
☆10Oct 22, 2017Updated 8 years ago
indix / whatthelang
View on GitHub
Lightning Fast Language Prediction 🚀
☆168Aug 22, 2025Updated 11 months ago