jvparidon / subs2vec
Tools for training and evaluating word embeddings based on subtitles. Published as "subs2vec: Word embeddings from subtitles in 55 languages" in Behavior Research Methods.
☆33Updated 4 years ago
Alternatives and similar repositories for subs2vec:
Users that are interested in subs2vec are comparing it to the libraries listed below
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆45Updated last month
- English Small World of Words SWOWEN-2018☆66Updated 2 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- Sentiment Analysis and Cognition Engine (text analysis tool)☆19Updated 4 years ago
- Package to extract connotation frames☆85Updated last year
- A list of publicly available data sets from psycholinguistic studies☆31Updated 8 years ago
- A simple vector space model based tool for sentiment analysis of literary texts☆17Updated 7 months ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆25Updated last year
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Code for auto-generating maze distractors and running maze in ibex☆23Updated 9 months ago
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆54Updated 2 years ago
- Python Multilingual Ucrel Semantic Analysis System☆32Updated 8 months ago
- The Extended Moral Foundations Dictionary (E-MFD)☆40Updated 4 years ago
- Driver for LIWC2015 analysis. LIWC2015 dictionary not included.☆16Updated 2 years ago
- Data on verb transitivity in English and script to extract transitivity information from Google's syntactic ngrams corpus☆11Updated 6 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Fast, flexible extraction of moral information from textual input data.☆108Updated last year
- Gender stereotypes are reflected in the distributional structure of 25 languages☆17Updated 2 years ago
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆40Updated 4 years ago
- PennController is a library extension for IBEX. It introduces a flexible and user-friendly syntax to design dynamic (e.g., scripted/timed…☆23Updated 2 years ago
- A psycholinguistic modeling toolkit☆27Updated 2 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated 2 months ago
- ☆27Updated 2 years ago
- Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts☆16Updated this week
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 5 months ago
- Methods and Measures for Semantic Network Analysis☆23Updated last year
- ☆22Updated 4 years ago
- Neural Language Models for Historical Research☆26Updated 6 months ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆53Updated 4 years ago
- Data, codebook, and models to automatically detect storytelling.☆18Updated 2 weeks ago