English word segmentation, written in pure-Python, and based on a trillion-word corpus.
☆377Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for python-wordsegment
Users that are interested in python-wordsegment are comparing it to the libraries listed below
Sorting:
- Code for my blog post on Generating Words from Embeddings☆23Jul 25, 2024Updated last year
- Python code and data for the post "Word Segmentation, or Makingsenseofthis"☆17Oct 24, 2022Updated 3 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆633Jun 24, 2021Updated 4 years ago
- Building and Using A Seed Corpus for the Human Language Project☆11Feb 9, 2018Updated 8 years ago
- Python pattern matching like functional languages.☆161Feb 14, 2021Updated 5 years ago
- NLP, before and after spaCy☆2,235Sep 22, 2023Updated 2 years ago
- Text pattern search using marisa-trie☆18Jan 26, 2025Updated last year
- Use word vectors to interactively generate lists of similar words☆112Jan 10, 2018Updated 8 years ago
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆934Nov 20, 2022Updated 3 years ago
- Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set☆3,929Mar 8, 2024Updated last year
- The Non-Official Characterization (NOC) List is a knowledge-base containing semantic triples about famous people, living and dead, fictio…☆24Jan 9, 2019Updated 7 years ago
- Python search module for fast approximate string matching☆54Jan 25, 2023Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆857Feb 13, 2026Updated 2 weeks ago
- Neural network poetry rewriter☆21Feb 4, 2022Updated 4 years ago
- Rule-based pronunciation for English☆24Sep 21, 2018Updated 7 years ago
- Simple method used to load configuration variables from different sources.☆10Jun 20, 2018Updated 7 years ago
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,708Apr 13, 2025Updated 10 months ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆52Jul 19, 2013Updated 12 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆676Jun 2, 2025Updated 9 months ago
- Generative poetry from a recurrent neural network filtered by emotional and external influences.☆25May 15, 2016Updated 9 years ago
- Implementation of Trimmed Grassmann Average (TGA) by Hauberg S et al. in Python☆23Oct 10, 2015Updated 10 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,891Feb 9, 2026Updated 3 weeks ago
- ☆12Sep 15, 2025Updated 5 months ago
- A tiny script to convert your mdx dictionary file to CSV☆11Dec 22, 2018Updated 7 years ago
- Spellchecker service based on hunspell for 90 languages☆10Oct 26, 2020Updated 5 years ago
- Fast Vector Operations on Pretty Big Data☆13Nov 17, 2015Updated 10 years ago
- Configure an LDAPS Endpoint for Simple AD☆14Aug 29, 2017Updated 8 years ago
- Reflection metadata support for classes and functions with flowtype type aliases support☆10Nov 16, 2017Updated 8 years ago
- 🔧 SQL for csv file in UNIX command line with awk.☆16Aug 6, 2022Updated 3 years ago
- Genrates python dependency graph☆22Aug 10, 2018Updated 7 years ago
- ☆10Dec 11, 2016Updated 9 years ago
- CoreML + S4TF Transfer Learning with Embedding and Multi Input☆12Mar 21, 2020Updated 5 years ago
- The Average Novel☆10Dec 2, 2017Updated 8 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,517Apr 18, 2025Updated 10 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,014Oct 30, 2024Updated last year
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- Library for fast text representation and classification.☆26,502Mar 22, 2024Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆201Oct 6, 2020Updated 5 years ago