libindic / soundex
Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string comparison
☆56Updated 5 years ago
Alternatives and similar repositories for soundex:
Users that are interested in soundex are comparing it to the libraries listed below
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- A web application tagging and retrieval of arguments in text☆29Updated last year
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Python library for converting UTF to WX and vice-versa for Indian languages.☆48Updated 2 years ago
- Transliteration module for Indian Languages☆77Updated last year
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- spaCy-to-naf converter☆21Updated 7 months ago
- WordNet Domains, WordNet Affect and SentiWords☆49Updated 9 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last month
- LNEx: Location Name Extractor☆24Updated 4 years ago
- Generating Wikipedia article embeddings using Word2vec and reading sessions☆18Updated 7 years ago
- Create a Geonames gazetteer index in Elasticsearch☆74Updated last year
- Calculate readability scores☆40Updated 5 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Wikidata embedding☆51Updated 2 months ago
- IXA pipes Named Entity Tagger (http://ixa2.si.ehu.es/ixa-pipes).☆32Updated 5 years ago