amsqr / Spanish-Metaphone
Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and …
☆35Updated 11 years ago
Alternatives and similar repositories for Spanish-Metaphone
Users that are interested in Spanish-Metaphone are comparing it to the libraries listed below
Sorting:
- varied english texts for modern NLP testing☆75Updated 2 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- A network clustering library for javascript☆34Updated 3 weeks ago
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- A lightweight end-to-end NLP and visualization platform to make WordStream.☆43Updated 2 years ago
- command-line tool to extract taxonomies from Wikidata☆125Updated 5 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆47Updated 2 years ago
- A trend viewer written in Python/JavaScript☆21Updated 6 months ago
- Specification of NAF, the NLP annotation format☆21Updated 4 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- List of (possible) English hedge words☆46Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- The RICardo dataset compiles trade statistics sources of international trade bilateral flows of the 19th century.☆18Updated this week
- An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io☆79Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Spanish data from the AnCora corpus.☆30Updated last week
- Events and Situations Ontology☆14Updated 7 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated last year
- Data Store for Annotation Studio☆46Updated 2 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- NLTK Contrib☆166Updated last year
- An implementation of latent Dirichlet allocation in javascript☆184Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- linguistics backend☆41Updated 2 years ago
- node.js interface to the ConceptNet semantic network API [DEPRECATED; ConceptNet API has changed]☆30Updated 7 years ago
- Multi Tier Annotation Search☆12Updated last year
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago