English word segmentation, written in pure-Python, and based on a trillion-word corpus.
☆377Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for python-wordsegment
Users that are interested in python-wordsegment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python module for computing statistics and regression in a single pass.☆101Jul 13, 2021Updated 4 years ago
- Python pattern matching like functional languages.☆161Feb 14, 2021Updated 5 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆870Feb 19, 2023Updated 3 years ago
- Code for my blog post on Generating Words from Embeddings☆23Jul 25, 2024Updated last year
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆631Jun 24, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Building and Using A Seed Corpus for the Human Language Project☆11Feb 9, 2018Updated 8 years ago
- Python Sorted Collections Library☆111Nov 28, 2022Updated 3 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Oct 1, 2020Updated 5 years ago
- Easy language identification of 380 languages☆17Dec 2, 2019Updated 6 years ago
- Thoughts toward and tutorial on corpus-driven narrative generation☆25Nov 5, 2020Updated 5 years ago
- Spell correct entire sentences using nltk freqdist and symspell☆19Jul 3, 2017Updated 8 years ago
- Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set☆3,943Mar 8, 2024Updated 2 years ago
- The Non-Official Characterization (NOC) List is a knowledge-base containing semantic triples about famous people, living and dead, fictio…☆24Jan 9, 2019Updated 7 years ago
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆932Nov 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.☆2,878Aug 10, 2024Updated last year
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆675Jun 2, 2025Updated 11 months ago
- NLP, before and after spaCy☆2,242Sep 22, 2023Updated 2 years ago
- Text pattern search using marisa-trie☆19Jan 26, 2025Updated last year
- Configure an LDAPS Endpoint for Simple AD☆14Aug 29, 2017Updated 8 years ago
- Port of Google's language-detection library to Python.☆1,885Mar 3, 2025Updated last year
- Python search module for fast approximate string matching☆54Jan 25, 2023Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆205Oct 6, 2020Updated 5 years ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,711Apr 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Listaa raideja ja silleen☆16Nov 2, 2022Updated 3 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- Spoken Language Translation System☆14Jun 25, 2019Updated 6 years ago
- Use word vectors to interactively generate lists of similar words☆112Jan 10, 2018Updated 8 years ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,246Aug 31, 2022Updated 3 years ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,414Apr 21, 2026Updated last month
- Library for fast text representation and classification.☆26,524Mar 22, 2024Updated 2 years ago
- A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912☆24Jul 8, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,529Apr 18, 2025Updated last year
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,595May 19, 2026Updated last week
- bin files☆13Jan 30, 2025Updated last year
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Apr 23, 2016Updated 10 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,890Mar 27, 2026Updated last month
- Rule-based pronunciation for English☆24Sep 21, 2018Updated 7 years ago
- Implementation of Trimmed Grassmann Average (TGA) by Hauberg S et al. in Python☆23Oct 10, 2015Updated 10 years ago