kbatsuren/CogNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kbatsuren/CogNet)

kbatsuren / CogNet

CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

☆56

Alternatives and similar repositories for CogNet

Users that are interested in CogNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kbatsuren / wiktra
View on GitHub
Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)
☆37Jun 29, 2025Updated last year
apertium / lexd
View on GitHub
A lexicon compiler for non-suffixational morphologies
☆15Jan 29, 2026Updated 6 months ago
loanwordbank / loanpy
View on GitHub
LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …
☆16Updated this week
lingpy / lingrex
View on GitHub
Linguistic Reconstruction with LingPy
☆16Aug 5, 2024Updated last year
clefourrier / EtymDB
View on GitHub
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
☆28Jan 4, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liao961120 / linguisticsdown
View on GitHub
Easy Linguistics Document Writing with R Markdown
☆27Mar 10, 2019Updated 7 years ago
lingpy / lingpy
View on GitHub
LingPy: Python library for quantitative tasks in historical linguistics
☆145May 27, 2026Updated 2 months ago
byungdoh / llm_surprisal
View on GitHub
Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Am…
☆23Mar 7, 2024Updated 2 years ago
segbo-db / segbo
View on GitHub
SegBo: A database of borrowed sounds in the world’s languages
☆16Mar 20, 2024Updated 2 years ago
acoli-repo / acoli-dicts
View on GitHub
3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, G…
☆17Jul 19, 2023Updated 3 years ago
DuyguA / DEMorphy
View on GitHub
German Morphological Analyzer
☆54Nov 12, 2021Updated 4 years ago
cultural-csk / candle
View on GitHub
Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)
☆11Feb 15, 2024Updated 2 years ago
wenkokke / dep2con
View on GitHub
several algorithms for converting dependency structures into constituency structures.
☆10Feb 7, 2022Updated 4 years ago
sigmorphon / 2022SegmentationST
View on GitHub
SIGMORPHON 2022 Shared Task on Morpheme Segmentation
☆36Mar 26, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
concepticon / norare-data
View on GitHub
Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts
☆16Updated this week
blcuicall / TR-Reading-List
View on GitHub
A text readability reading list maintained by BLCU ICALL Research Group
☆13Mar 27, 2020Updated 6 years ago
dginev / ar5iv-css
View on GitHub
Some CSS experiments for arXiv HTML documents converted via latexml
☆20Jul 5, 2026Updated 3 weeks ago
valentinhofmann / superbizarre
View on GitHub
Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"
☆18Aug 17, 2021Updated 4 years ago
babylm / baseline-pretraining
View on GitHub
Code for pre-training BabyLM baseline models.
☆16Jun 19, 2023Updated 3 years ago
MITLibraries / oclc-api-python-scripts
View on GitHub
Python scripts for retrieving data from the OCLC APIs
☆12Jul 25, 2023Updated 3 years ago
rnd2110 / MorphAGram
View on GitHub
A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars
☆17Jun 14, 2024Updated 2 years ago
matbahasa / MALINDO_Morph
View on GitHub
Kamus morfologi untuk bahasa Melayu/Indonesia
☆17Nov 23, 2024Updated last year
lexibank / lexibank-analysed
View on GitHub
Study on lexibank data (presenting the lexibank dataset).
☆16Jun 16, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
maafiah / VXGL
View on GitHub
☆16Mar 2, 2024Updated 2 years ago
jspsych / webbook
View on GitHub
A web-based textbook for jsPsych
☆12Oct 19, 2021Updated 4 years ago
atsushieno / aap-lv2
View on GitHub
AAP LV2 support: wrapper, the foundation for LV2 plugin ports to Android. See also aap-core Wiki for the list of ports.
☆11Jul 13, 2026Updated 2 weeks ago
isi-nlp / carmel
View on GitHub
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆15Jan 24, 2017Updated 9 years ago
omwn / omw-data
View on GitHub
This packages up data for the Open Multilingual Wordnet
☆69Mar 28, 2026Updated 4 months ago
sigmorphon / 2021Task0
View on GitHub
☆19Oct 14, 2021Updated 4 years ago
Princeton-CDH / geniza
View on GitHub
version 4.x of the Princeton Geniza Project
☆13Jul 9, 2026Updated 2 weeks ago
idiap / wmil-sgd
View on GitHub
Weighted multiple-instance learning algorithm based on stochastic gradient descent
☆12Feb 22, 2019Updated 7 years ago
JackEdTaylor / LexOPS
View on GitHub
An R Package and Shiny App for generating matched stimuli for factiorial-design experiments.
☆29Jan 16, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bnicenboim / bcogsci
View on GitHub
Datasets and models included in the book "Introduction to Bayesian Data Analysis for Cognitive Science".
☆17Apr 21, 2026Updated 3 months ago
thibo73800 / pytorch_nlp
View on GitHub
Introduction to pytorch and NLP
☆14Sep 22, 2019Updated 6 years ago
nchibana / moviearcs
View on GitHub
A tool that visualizes emotional arcs of movie scripts
☆18Dec 8, 2022Updated 3 years ago
bayartsogt-ya / albert-mongolian
View on GitHub
ALBERT trained on Mongolian text corpus
☆19Jan 10, 2021Updated 5 years ago
dzieciou / lemmatizer-pl
View on GitHub
Python lemmatizer for Polish.
☆19Sep 25, 2019Updated 6 years ago
vineetdhanawat / twitter-sentiment-analysis
View on GitHub
Twitter Sentiment Analysis - BITS Pilani
☆12Mar 27, 2014Updated 12 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago