kent37 / guess-languageLinks
Automatically exported from code.google.com/p/guess-language
☆54Updated 2 months ago
Alternatives and similar repositories for guess-language
Users that are interested in guess-language are comparing it to the libraries listed below
Sorting:
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆38Updated 11 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- ☆52Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 3 months ago
- Memory-based shallow parser for Python☆74Updated 6 years ago
- Utility library to turn country names into ISO two-letter codes☆71Updated 4 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- stav text annotation visualiser☆34Updated 14 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Python port for IWNLP.Lemmatizer☆18Updated 2 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Extract dates from text☆66Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated 3 weeks ago
- Retrieve quotes from any Wikiquote article☆117Updated 8 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆31Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆79Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆171Updated 4 years ago
- A Generalized Suffix Tree for any Python iterable using Ukkonen's algorithm, with Lowest Common Ancestor retrieval.☆54Updated 2 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- Text readability metrics in Python.☆11Updated 12 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Detect and visualize text reuse☆118Updated last year
- Scalable String Similarity Joins in Python☆39Updated last year