amsqr / Spanish-MetaphoneLinks
Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and …
☆35Updated 11 years ago
Alternatives and similar repositories for Spanish-Metaphone
Users that are interested in Spanish-Metaphone are comparing it to the libraries listed below
Sorting:
- An implementation of latent Dirichlet allocation in javascript☆185Updated 2 years ago
- The curation repository for the data behind Concepticon.☆39Updated last week
- command-line tool to extract taxonomies from Wikidata☆128Updated 6 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆43Updated last year
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps☆22Updated 3 years ago
- A lightweight end-to-end NLP and visualization platform to make WordStream.☆43Updated 2 years ago
- stoplists for African languages generated from the ASP corpus☆14Updated 9 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆73Updated last year
- varied english texts for modern NLP testing☆75Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- https://www.kaylinpavlik.com/50-years-of-pop-music/☆111Updated 7 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated last week
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated last month
- ☆72Updated 6 months ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 weeks ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- NLTK Contrib☆166Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- A network clustering library for javascript☆34Updated 2 months ago
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆49Updated 2 weeks ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 6 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- A geostatistical based approach to doing Toponym Resolution☆19Updated 8 years ago
- Scrapes some Finnish word definitions from English Wiktionary.☆8Updated last year
- Universal Dependencies online documentation☆288Updated this week