lpmi-13 / machine_readable_wordlistsLinks
A collection of word lists in machine readable, web-native (.yml and .json) format
☆23Updated 2 years ago
Alternatives and similar repositories for machine_readable_wordlists
Users that are interested in machine_readable_wordlists are comparing it to the libraries listed below
Sorting:
- Gather modern English word frequencies from all enwiki articles.☆220Updated last year
- Lexical database for ~70k English words with morphological variables☆44Updated 3 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆136Updated 5 years ago
- ☆64Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- Natural Language Processing Research in North American Linguistics Departments☆20Updated 4 months ago
- Utility for behavioral and representational analyses of Language Models☆153Updated 2 weeks ago
- This is a monolingual English corpus of native, non-native and (human) translated texts extracted from the European Parliament.☆9Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆35Updated 4 months ago
- Sentence aligner☆115Updated 4 years ago
- A Python Wiktionary Parser☆361Updated 5 months ago
- A repository for the 2022 Inflection Shared Task☆9Updated 3 years ago
- A list of vocabulary lists☆21Updated 5 years ago
- Framework for training dependency parsing models.☆11Updated last year
- Runnable morphological analysis tools from the UniMorph project☆16Updated 6 years ago
- The University of Pittsburgh English Language Institute Corpus (PELIC) dataset☆24Updated 2 years ago
- Python for Linguists – a Gentle Introduction to Programming☆45Updated 9 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆490Updated 8 months ago
- Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Am…☆16Updated last year
- A python module for English lemmatization and inflection.☆268Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 4 years ago
- A toolkit to create, launch and monitor SLURM jobs over existing python scripts.☆12Updated last year
- English data☆211Updated 2 weeks ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆18Updated 2 years ago
- The Open English WordNet☆588Updated 3 weeks ago
- Open German WordNet☆96Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆109Updated 6 years ago
- The UC Davis Corpus of Written Spanish, L2 and Heritage Speakers☆17Updated 3 months ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆349Updated 3 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆32Updated 5 years ago