droher / etymology-db
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
☆93Updated 11 months ago
Alternatives and similar repositories for etymology-db:
Users that are interested in etymology-db are comparing it to the libraries listed below
- Interactive visualization of Wiktionary words and etymologies.☆92Updated 2 months ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- eXtensible Interlinear Glossed Text☆33Updated 2 years ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆51Updated 5 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 2 years ago
- A program for automated scansion of verse.☆19Updated 10 years ago
- A Python module to discover the etymology of words☆150Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆90Updated last year
- Latin BERT☆60Updated 10 months ago
- Etymological graphs based on Wiktionary dumps☆21Updated 2 months ago
- LGPSI: An open, expansive Greek-reading composition project☆143Updated 4 months ago
- The curation repository for the data behind Concepticon.☆38Updated last week
- tool for collectively summarizing large discussions☆143Updated 2 years ago
- Perseus Treebank Data☆72Updated 10 months ago
- ☆92Updated last week
- English Resource Grammar☆21Updated 9 months ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- ZeuScansion is a fst-based system capable of performing metrical scansion of poetry written in English.☆38Updated 2 years ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆21Updated 6 years ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated 2 years ago
- Ancient Greek language models for spaCy☆29Updated last month
- The Global WordNet Association Collaborative Inter-Lingual Index☆42Updated 6 months ago
- Poetic processing, for Python.☆40Updated last year
- The World Atlas Of Language Structures Online☆128Updated 3 months ago
- Collaborative data curation for Glottolog☆160Updated last week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated 2 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆48Updated last year
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆37Updated 6 months ago
- A tool for analyzing the word histories of a text.☆34Updated 5 months ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆222Updated 2 years ago