Featurize words into orthographic and phonological vectors.
☆41May 20, 2023Updated 2 years ago
Alternatives and similar repositories for wordkit
Users that are interested in wordkit are comparing it to the libraries listed below
Sorting:
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Parser for KAF NAF files written in Python☆16Jul 1, 2021Updated 4 years ago
- speakr: A Wrapper for the Phonetic Software Praat☆26Mar 7, 2025Updated 11 months ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Easy Linguistics Document Writing with R Markdown☆27Mar 10, 2019Updated 6 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- tools for phoneticians and phonologists☆32Dec 5, 2018Updated 7 years ago
- Unsupervised concept extraction from clinical text☆14Jun 17, 2024Updated last year
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated last month
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 6 years ago
- Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…☆12Aug 15, 2023Updated 2 years ago
- The curation repository for the data behind Concepticon.☆42Feb 19, 2026Updated last week
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 10 months ago
- An R package for easy and flexible Bayesian Measurement Modeling☆17Updated this week
- Bayes Factors for brms Models☆14May 26, 2022Updated 3 years ago
- ☆17Jul 22, 2020Updated 5 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Dec 18, 2022Updated 3 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Sep 7, 2022Updated 3 years ago
- ggdmc provides tools to conduct Bayesian inference on a range of choice response time models.☆19Oct 29, 2025Updated 4 months ago
- Model implementation for the contextual embeddings project☆40Jun 2, 2025Updated 8 months ago
- Personal resources for my PhD, focusing on Bayesian inference and different programming languages☆33Mar 20, 2021Updated 4 years ago
- Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".☆22Oct 3, 2019Updated 6 years ago
- VOT manipulation☆20Feb 19, 2023Updated 3 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Dec 2, 2017Updated 8 years ago
- We constructed an EEG dataset based on imagined speech and performed semantic decoding on it.☆33Dec 13, 2024Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆29Nov 18, 2025Updated 3 months ago
- Text processing library for sentiment analysis and related tasks☆27Oct 25, 2018Updated 7 years ago
- R package for dealing with Eprime txt files☆25Apr 25, 2025Updated 10 months ago
- ☆25Apr 28, 2020Updated 5 years ago
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆11Feb 20, 2026Updated last week
- A web application tagging and retrieval of arguments in text☆30May 1, 2023Updated 2 years ago
- Alpino parser and related tools for Dutch☆27Updated this week
- rPraat package for R☆30Dec 9, 2021Updated 4 years ago
- Lecture notes from a statistics course I teach at Potsdam☆29Sep 9, 2020Updated 5 years ago
- Make Praat Picture style plots of acoustic data☆37Feb 4, 2026Updated 3 weeks ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- R package for bridge sampling☆34Nov 18, 2025Updated 3 months ago
- Rank-normalization, folding, and localization: An improved R-hat for assessing convergence of MCMC☆32Nov 10, 2021Updated 4 years ago