droher / etymology-db
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
☆79Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for etymology-db
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆50Updated 5 years ago
- Interactive visualization of Wiktionary words and etymologies.☆90Updated this week
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆21Updated 2 years ago
- Collaborative data curation for Glottolog☆152Updated this week
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆34Updated last month
- eXtensible Interlinear Glossed Text☆31Updated 2 years ago
- A web framework to display Cross Linguistic Linked Data.☆54Updated last month
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆97Updated 6 years ago
- Helsinki Finite-State Technology (library and application suite)☆124Updated this week
- A list of vocabulary lists☆21Updated 4 years ago
- Dynamic JavaScript version of phpSyntaxTree - a tool to draw syntax trees from labelled bracket notation.☆82Updated 8 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- This packages up data for the Open Multilingual Wordnet☆43Updated 3 weeks ago
- Automatically exported from code.google.com/p/foma☆117Updated 4 months ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆22Updated 4 years ago
- A Python module to discover the etymology of words☆145Updated 6 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Grammatical Framework's Resource Grammar Library (RGL)☆52Updated last week
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Grammatical Framework core: compiler, shell & runtimes☆131Updated 3 weeks ago
- Imports Wiktionary's grammatical data into Wikidata☆17Updated 4 years ago
- Latin BERT☆57Updated 4 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆32Updated this week
- University of Colorado VerbNet☆101Updated 6 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆43Updated last year
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆11Updated 11 months ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆211Updated last year
- The World Atlas of Language Structures☆55Updated last month
- Wikidata lexemes presentations☆24Updated last week