Mimino666 / langdetectView external linksLinks
Port of Google's language-detection library to Python.
☆1,871Mar 3, 2025Updated 11 months ago
Alternatives and similar repositories for langdetect
Users that are interested in langdetect are comparing it to the libraries listed below
Sorting:
- Stand-alone language identification system☆2,452Jan 1, 2020Updated 6 years ago
- Multilingual text (NLP) processing toolkit☆2,361Nov 10, 2023Updated 2 years ago
- ☆178Mar 28, 2025Updated 10 months ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,630Nov 21, 2025Updated 2 months ago
- Library for fast text representation and classification.☆26,481Mar 22, 2024Updated last year
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,173Nov 27, 2025Updated 2 months ago
- Topic Modelling for Humans☆16,355Nov 1, 2025Updated 3 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Jun 26, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,355Oct 27, 2025Updated 3 months ago
- State-of-the-Art Text Embeddings☆18,225Updated this week
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,725Updated this week
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,506Updated this week
- Fuzzy String Matching in Python☆9,271Feb 24, 2023Updated 2 years ago
- 80x faster and 95% accurate language identification with Fasttext☆164Jan 23, 2024Updated 2 years ago
- An open-source NLP research library, built on PyTorch.☆11,890Nov 22, 2022Updated 3 years ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,627Updated this week
- NLP, before and after spaCy☆2,232Sep 22, 2023Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,143Sep 30, 2025Updated 4 months ago
- python parser for human readable dates☆2,780Updated this week
- Module for automatic summarization of text documents and HTML pages.☆3,659Dec 29, 2025Updated last month
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,977Dec 6, 2025Updated 2 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,209Feb 1, 2026Updated last week
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- Extract Keywords from sentence or Replace keywords in sentences.☆5,709Apr 13, 2025Updated 10 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,403Nov 7, 2025Updated 3 months ago
- Open source annotation tool for machine learning practitioners.☆10,534Updated this week
- extract text from any document. no muss. no fuss.☆4,434Feb 4, 2026Updated last week
- A python tool for evaluating the quality of sentence embeddings.☆2,107Mar 19, 2024Updated last year
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,012Oct 30, 2024Updated last year
- Lightning Fast Language Prediction 🚀☆167Aug 22, 2025Updated 5 months ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,491Jan 14, 2026Updated 3 weeks ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 6 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,397Jan 31, 2026Updated last week
- A Fast, Extensible Progress Bar for Python and CLI☆30,948Feb 4, 2026Updated last week
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,515Apr 18, 2025Updated 9 months ago
- Language Detection with Infinity-gram☆230Jul 9, 2015Updated 10 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Jan 23, 2026Updated 3 weeks ago
- A library for efficient similarity search and clustering of dense vectors.☆39,076Updated this week
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,348Dec 22, 2025Updated last month