remusao / wgraphLinks
Etymological graphs based on Wiktionary dumps
☆21Updated 5 months ago
Alternatives and similar repositories for wgraph
Users that are interested in wgraph are comparing it to the libraries listed below
Sorting:
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆53Updated 5 years ago
- Interactive visualization of Wiktionary words and etymologies.☆93Updated last month
- Helsinki Finite-State Technology (library and application suite)☆133Updated 2 months ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Audiobook alignment for Indigenous languages☆40Updated 2 weeks ago
- British English pronunciation dictionary☆95Updated 7 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆105Updated last week
- Converts English text to IPA notation☆389Updated 2 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆82Updated last year
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated last year
- A Python Wiktionary Parser☆362Updated 2 weeks ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆156Updated 7 months ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆128Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- Rewrite of pyetymology in js. Extracts etymological information from Wiktionary and displays it in a graph.☆16Updated 2 years ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆284Updated 4 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆64Updated last week
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated 10 months ago
- Python 3 library for accenting (and analyzing the accentuation of) Ancient Greek words☆57Updated 3 years ago
- A Python module to discover the etymology of words☆150Updated last year
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 8 months ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆38Updated 6 months ago
- Data for the International Phonetic Alphabet (IPA)☆31Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆220Updated last year
- Automatically exported from code.google.com/p/foma☆122Updated 5 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆95Updated 2 years ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆24Updated 4 years ago
- CMUdict maintenance, and tools☆228Updated 7 months ago