Port of Google's language-detection library to Python.
☆1,890Mar 3, 2025Updated last year
Alternatives and similar repositories for langdetect
Users that are interested in langdetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stand-alone language identification system☆2,460Jan 1, 2020Updated 6 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆766Feb 25, 2019Updated 7 years ago
- Multilingual text (NLP) processing toolkit☆2,366Nov 10, 2023Updated 2 years ago
- ☆179Mar 28, 2025Updated last year
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,733Apr 23, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Apr 19, 2026Updated last month
- Library for fast text representation and classification.☆26,529Mar 22, 2024Updated 2 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,652May 19, 2026Updated 3 weeks ago
- Topic Modelling for Humans☆16,432Nov 1, 2025Updated 7 months ago
- 80x faster and 95% accurate language identification with Fasttext☆168May 26, 2026Updated 2 weeks ago
- ☆886May 24, 2023Updated 3 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,376Oct 27, 2025Updated 7 months ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,535Updated this week
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,808Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NLP, before and after spaCy☆2,241Sep 22, 2023Updated 2 years ago
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,805Updated this week
- Language Detection with Infinity-gram☆230Jul 9, 2015Updated 10 years ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,899Updated this week
- python parser for human readable dates☆2,814Updated this week
- Fuzzy String Matching in Python☆9,258Feb 24, 2023Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,233Sep 30, 2025Updated 8 months ago
- An open-source NLP research library, built on PyTorch.☆11,892Nov 22, 2022Updated 3 years ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,213Apr 22, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Easy language identification of 380 languages☆17Dec 2, 2019Updated 6 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆15,077May 13, 2026Updated last month
- Module for automatic summarization of text documents and HTML pages.☆3,690Jun 8, 2026Updated last week
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,043Oct 30, 2024Updated last year
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 7 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,533Apr 18, 2025Updated last year
- extract text from any document. no muss. no fuss.☆4,614May 7, 2026Updated last month
- Data augmentation for NLP☆4,658Updated this week
- (unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆4,257May 19, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Extract Keywords from sentence or Replace keywords in sentences.☆5,715Apr 13, 2025Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,107Mar 19, 2024Updated 2 years ago
- Open source annotation tool for machine learning practitioners.☆10,675Apr 14, 2026Updated 2 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆872Updated this week
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,245Aug 31, 2022Updated 3 years ago
- A library for efficient similarity search and clustering of dense vectors.☆40,249Updated this week
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,406Mar 27, 2026Updated 2 months ago