byungdoh / llm_surprisal
Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Amount, and Surprisal’s Fit to Reading Times," EACL24)
☆13Updated last year
Alternatives and similar repositories for llm_surprisal:
Users that are interested in llm_surprisal are comparing it to the libraries listed below
- SAP Benchmark☆15Updated 6 months ago
- A psycholinguistic modeling toolkit☆26Updated 3 weeks ago
- ☆12Updated 2 years ago
- Structural Supervision & Human Psycholinguistic Data☆10Updated 3 years ago
- ☆16Updated 3 years ago
- A neural language model that estimates incremental processing complexity☆39Updated 3 years ago
- Corpus of naturalistic stories with annotation and psycholinguistic measures☆52Updated 3 years ago
- ☆22Updated 3 years ago
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆11Updated last year
- A simple, Python-based, command-line runner for MGIZA++.☆11Updated 3 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆26Updated 3 years ago
- Analysis pipeline for Revisiting UID (EMNLP 2021)☆11Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- Collection of available data sources for cognitively-inspired NLP☆39Updated 4 years ago
- Data Sets and Models for Evaluation of Lexical Semantic Change Detection☆28Updated 2 years ago
- ☆18Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- Code and Results for "Universals of word order reflect optimization of grammars for efficient communication"☆15Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆21Updated 5 months ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- ☆18Updated last year
- Data, codebook, and models to automatically detect storytelling.☆15Updated last year
- ☆24Updated 10 months ago
- Organized inventory of research using the Abstract Meaning Representation☆37Updated last week
- ☆19Updated 3 years ago
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Updated 4 years ago
- ☆29Updated last year
- ☆39Updated 3 years ago