clips/wordkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/clips/wordkit)

clips / wordkit

Featurize words into orthographic and phonological vectors.

☆43

Alternatives and similar repositories for wordkit

Users that are interested in wordkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
cltl / KafNafParserPy
View on GitHub
Parser for KAF NAF files written in Python
☆16Jul 1, 2021Updated 5 years ago
liao961120 / linguisticsdown
View on GitHub
Easy Linguistics Document Writing with R Markdown
☆27Mar 10, 2019Updated 7 years ago
concepticon / concepticon-data
View on GitHub
The curation repository for the data behind Concepticon.
☆45Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
apertium / lexd
View on GitHub
A lexicon compiler for non-suffixational morphologies
☆15Jan 29, 2026Updated 5 months ago
TakeLab / spacy-udpipe
View on GitHub
spaCy + UDPipe
☆167May 27, 2026Updated last month
cisnlp / GlotCC
View on GitHub
[NeurIPS 2024] 🕸 GlotCC Dataset and Pipline
☆21Apr 6, 2025Updated last year
Hyperparticle / LemmaTag
View on GitHub
A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …
☆34Apr 5, 2019Updated 7 years ago
datovar4 / Ai_Literature_Review_Suite
View on GitHub
☆21Mar 27, 2025Updated last year
delb-xml / delb-existdb
View on GitHub
A Python database interface for eXist-db
☆15Jul 1, 2026Updated 2 weeks ago
jwieting / simple-and-effective-paraphrastic-similarity
View on GitHub
Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".
☆22Oct 3, 2019Updated 6 years ago
nlesc-sherlock / spaCy-dutch
View on GitHub
Repository for creating models, vocabulary and other necessities for Dutch in Spacey
☆11Dec 15, 2016Updated 9 years ago
dayeonki / mt_feedback
View on GitHub
Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]
☆14Apr 3, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xigt / xigt
View on GitHub
eXtensible Interlinear Glossed Text
☆34May 16, 2022Updated 4 years ago
julianbrooke / GutenTag
View on GitHub
☆32Mar 14, 2017Updated 9 years ago
facebookresearch / multiloko
View on GitHub
A benchmark with locally sourced multilingual questions for 31 languages.
☆18May 13, 2026Updated 2 months ago
rsprouse / xray_microbeam_database
View on GitHub
Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)
☆14Oct 8, 2020Updated 5 years ago
lukasgarbas / can-we-tune-together
View on GitHub
Combining encoder-based language models
☆11Nov 11, 2021Updated 4 years ago
jasom / pdfparse
View on GitHub
Port of Python's pdfminer to Lisp
☆15Jan 30, 2016Updated 10 years ago
newsreader / NAF
View on GitHub
Specification of NAF, the NLP annotation format
☆21Jan 19, 2021Updated 5 years ago
cldf-clts / clts
View on GitHub
Cross-Linguistic Transcription Systems
☆17Mar 20, 2026Updated 4 months ago
dill / bayes-gam-explainer
View on GitHub
Code and data to reproduce the examples in the paper Bayesian views of generalized additive modelling.
☆18Jan 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
vasilescur / parse_context
View on GitHub
Use GPT-3 to process human conversations and extract context, identify information that would be useful, and suggest data sources to get …
☆30Dec 21, 2021Updated 4 years ago
dpalmasan / TRUNAJOD2.0
View on GitHub
An easy-to-use library to extract indices from texts.
☆30Sep 7, 2021Updated 4 years ago
mbykov / cholok
View on GitHub
phonetic transcription for Tibetan
☆10Mar 13, 2019Updated 7 years ago
vineetdhanawat / twitter-sentiment-analysis
View on GitHub
Twitter Sentiment Analysis - BITS Pilani
☆12Mar 27, 2014Updated 12 years ago
lingpy / lingpy
View on GitHub
LingPy: Python library for quantitative tasks in historical linguistics
☆144May 27, 2026Updated last month
pyconll / pyconll
View on GitHub
A minimal, pure Python library to interface with CoNLL-U format files.
☆155Jul 6, 2026Updated 2 weeks ago
Helsinki-NLP / OPUS-MT-testsets
View on GitHub
benchmarks for evaluating MT models
☆11Jun 26, 2024Updated 2 years ago
PhonologicalCorpusTools / CorpusTools
View on GitHub
Phonological CorpusTools
☆122May 24, 2025Updated last year
CoEDL / elan-helpers
View on GitHub
Tools and scripts for working with ELAN
☆10Aug 4, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vincentarelbundock / violets
View on GitHub
Violets are BLUE. OLS is too. (R package)
☆14Aug 11, 2023Updated 2 years ago
thackl / treemmer-animate
View on GitHub
☆14Mar 28, 2018Updated 8 years ago
BayBenj / english-syllabifier
View on GitHub
Tool for parsing English phonemes into syllables.
☆10Jan 15, 2018Updated 8 years ago
mnavascues / ABCRFtutorial
View on GitHub
Crash Course on Approximate Bayesian Computation in Population Genetics
☆11Sep 19, 2022Updated 3 years ago
Zhang-Yihao / Transfomer2DFA
View on GitHub
Implementation for paper Automata Extraction from Transformers.
☆12Jun 8, 2024Updated 2 years ago
daemanos / pangloss
View on GitHub
Interlinear glosses for pandoc
☆10Feb 12, 2018Updated 8 years ago
vasishth / LM
View on GitHub
☆17Jul 22, 2020Updated 5 years ago