pmandera / semspaces
Semantic spaces in python
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for semspaces
- English Small World of Words SWOWEN-2018☆66Updated 2 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- A psycholinguistic modeling toolkit☆24Updated this week
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆40Updated this week
- Automatically exported from code.google.com/p/incremental-top-down-parser☆13Updated 9 years ago
- Easy black-box access to state-of-the-art language models☆14Updated last year
- Bayesian pragmatic models implemented in Python☆19Updated 8 years ago
- A list of publicly available data sets from psycholinguistic studies☆31Updated 8 years ago
- ADS Project☆14Updated 8 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆42Updated 4 years ago
- Code for morphological transformations☆29Updated 7 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Updated 6 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Updated 9 years ago
- A Combinatory Categorial Grammar library.☆22Updated 10 years ago
- New York Times Word Innovation Types dataset☆21Updated 3 years ago
- Discontinuous Data-Oriented Parsing☆46Updated 10 months ago
- Corpus of naturalistic stories with annotation and psycholinguistic measures☆50Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 5 months ago
- Multilingual Language Modeling Toolkit☆11Updated 7 years ago
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- Text readability metrics in Python.☆12Updated 11 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- Code for learning geographically-informed word embeddings☆22Updated 2 years ago
- Tools for training and evaluating word embeddings based on subtitles. Published as "subs2vec: Word embeddings from subtitles in 55 langua…☆33Updated 4 years ago
- Simple CORPORA list crawler☆10Updated 7 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- This repository contains the Framester resource, the main outcome of the framester project.☆34Updated 4 years ago