piantado / ngrampy
Tools in python for dealing with Google Books Ngram files and other similar data sets.
☆18Updated 10 years ago
Alternatives and similar repositories for ngrampy:
Users that are interested in ngrampy are comparing it to the libraries listed below
- Switchboard Dialog Act Corpus with Penn Treebank links☆144Updated 4 years ago
- Language Acquisition Research Tools☆41Updated last year
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated last month
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆26Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Modern SQLAlchemy wrappers for the MRC psycholinguistics database☆21Updated 10 years ago
- Python for Linguists – a Gentle Introduction to Programming☆44Updated 9 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- maximum entropy based part-of-speech tagger for NLTK☆45Updated 8 years ago
- Corpus of naturalistic stories with annotation and psycholinguistic measures☆53Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- linguistics backend☆41Updated 2 years ago
- English Small World of Words SWOWEN-2018☆66Updated 2 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago
- ☆34Updated 8 years ago
- ☆97Updated 3 years ago
- A list of publicly available data sets from psycholinguistic studies☆31Updated 8 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- Scripts and tools for doing unsupervised acceptability prediction.☆15Updated 2 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated last year
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆114Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Linguistica 5: Unsupervised Learning of Linguistic Structure☆30Updated 5 years ago
- Cross-lingual metaphor detection.☆66Updated 5 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 3 years ago
- Sentence specificity prediction☆25Updated 6 years ago
- Code for auto-generating maze distractors and running maze in ibex☆23Updated 8 months ago