remusao / wgraph
Etymological graphs based on Wiktionary dumps
☆18Updated last month
Alternatives and similar repositories for wgraph:
Users that are interested in wgraph are comparing it to the libraries listed below
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆51Updated 5 years ago
- Offline etymological dictionary based on Wiktionary data☆21Updated 2 years ago
- Helsinki Finite-State Technology (library and application suite)☆128Updated 3 weeks ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- A library for fetching and reading Tatoeba's weekly exports☆22Updated last year
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 4 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆89Updated 9 months ago
- A Python Wiktionary Parser☆357Updated last year
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆116Updated 4 years ago
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago
- A Python module to discover the etymology of words☆149Updated 9 months ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆48Updated last year
- Automatically exported from code.google.com/p/foma☆122Updated 7 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated last month
- Interactive visualization of Wiktionary words and etymologies.☆91Updated this week
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 3 months ago
- A list of vocabulary lists☆21Updated 4 years ago
- Data for the International Phonetic Alphabet (IPA)☆27Updated 2 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆35Updated 4 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆71Updated 2 months ago
- German Morphological Analyzer☆47Updated 3 years ago