byungdoh / llm_surprisal
Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Amount, and Surprisal’s Fit to Reading Times," EACL24)
☆14Updated last year
Alternatives and similar repositories for llm_surprisal:
Users that are interested in llm_surprisal are comparing it to the libraries listed below
- SAP Benchmark☆16Updated 7 months ago
- A psycholinguistic modeling toolkit☆27Updated 2 months ago
- Structural Supervision & Human Psycholinguistic Data☆10Updated 4 years ago
- ☆12Updated 3 years ago
- A neural language model that estimates incremental processing complexity☆39Updated 3 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆28Updated 3 years ago
- Code and Results for "Universals of word order reflect optimization of grammars for efficient communication"☆15Updated 2 years ago
- Corpus of naturalistic stories with annotation and psycholinguistic measures☆53Updated 3 years ago
- A simple, Python-based, command-line runner for MGIZA++.☆11Updated 3 years ago
- ☆18Updated 3 years ago
- ☆16Updated 3 years ago
- ☆24Updated last year
- Analysis pipeline for Revisiting UID (EMNLP 2021)☆11Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- ☆22Updated 3 years ago
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆10Updated last year
- Automated Semantic Analysis of Discourse Markers☆10Updated 2 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated 2 months ago
- ☆23Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- Collection of available data sources for cognitively-inspired NLP☆38Updated 4 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).☆15Updated 3 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 11 months ago
- Organized inventory of research using the Abstract Meaning Representation☆37Updated this week
- Data Sets and Models for Evaluation of Lexical Semantic Change Detection☆28Updated 2 years ago
- Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"☆13Updated 2 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- A framework for nonlinear continuous-time regression☆35Updated 3 months ago
- This repository contains the Potsdam Textbook Corpus (PoTeC) which is a natural reading eye-tracking corpus.☆12Updated 3 months ago