droher / etymology-dbLinks
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
☆141Updated last year
Alternatives and similar repositories for etymology-db
Users that are interested in etymology-db are comparing it to the libraries listed below
Sorting:
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆55Updated 6 years ago
- Interactive visualization of Wiktionary words and etymologies.☆94Updated 4 months ago
- LGPSI: An open, expansive Greek-reading composition project☆154Updated 2 months ago
- The Open English WordNet☆682Updated last week
- Making the public domain Loebs more easily downloadable. Data at https://github.com/ryanfb/loebolus-data☆101Updated 2 weeks ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated last year
- A Python module to discover the etymology of words☆151Updated last year
- A language evolution simulator, using realistic phonetic changes.☆39Updated 2 years ago
- Etymological graphs based on Wiktionary dumps☆23Updated 9 months ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆27Updated 5 years ago
- The World Atlas of Language Structures☆72Updated last year
- A Python Wiktionary Parser☆368Updated 5 months ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆78Updated 8 months ago
- ☆114Updated last week
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆161Updated last year
- A fluid medium for storing, relating, and surfacing thoughts.☆135Updated 3 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆25Updated 3 years ago
- Analyse rhyme scheme, metre and form of poems☆132Updated 4 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- FieldWorks is a suite of software tools for language and cultural data, with support for complex scripts.☆102Updated this week
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- browse wikipedia a la andy matuschak's evergreen notes☆29Updated last year
- poetry from dirty ocr☆62Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- Perseus Treebank Data☆76Updated last year
- ☆107Updated last year
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆224Updated 2 years ago
- CLDF: Cross-Linguistic Data Formats - the specification☆62Updated 4 months ago
- 📜 A CLI toolkit for extracting and working with your digital history☆184Updated last year