remusao / wgraphLinks
Etymological graphs based on Wiktionary dumps
☆23Updated 10 months ago
Alternatives and similar repositories for wgraph
Users that are interested in wgraph are comparing it to the libraries listed below
Sorting:
- Interactive visualization of Wiktionary words and etymologies.☆95Updated last week
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆55Updated 6 years ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆291Updated 9 months ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆143Updated last year
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆27Updated 5 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated this week
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆35Updated last year
- A Python Wiktionary Parser☆369Updated 5 months ago
- Automatically exported from code.google.com/p/foma☆126Updated 4 months ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- Offline etymological dictionary based on Wiktionary data☆23Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆53Updated 2 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆162Updated last year
- Machine-readable Wiktionary☆77Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- Monolingual wordlists with pronunciation information in IPA☆707Updated 7 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆66Updated 2 weeks ago
- Extract screenshots & audio clips from YouTube videos into Anki cards☆69Updated 4 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆54Updated 11 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated 2 years ago
- A list of vocabulary lists☆22Updated 5 years ago
- Spaced repetition for memorizing tons of things.☆165Updated 10 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated last month
- Machine-readable lists of lemma-token pairs in 23 languages.☆354Updated 3 years ago