byungdoh / llm_surprisal
Surprisal calculation using HuggingFace LMs ("Frequency Explains the Inverse Correlation of Large Language Models’ Size, Training Data Amount, and Surprisal’s Fit to Reading Times," EACL24)
☆14Updated last year
Alternatives and similar repositories for llm_surprisal:
Users that are interested in llm_surprisal are comparing it to the libraries listed below
- SAP Benchmark☆16Updated 6 months ago
- ☆12Updated 2 years ago
- A psycholinguistic modeling toolkit☆27Updated last month
- Structural Supervision & Human Psycholinguistic Data☆10Updated 4 years ago
- Collection of available data sources for cognitively-inspired NLP☆39Updated 4 years ago
- ☆16Updated 3 years ago
- ☆18Updated 2 years ago
- Corpus of naturalistic stories with annotation and psycholinguistic measures☆53Updated 3 years ago
- Analysis pipeline for Revisiting UID (EMNLP 2021)☆11Updated 2 years ago
- A neural language model that estimates incremental processing complexity☆39Updated 3 years ago
- Code and Results for "Universals of word order reflect optimization of grammars for efficient communication"☆15Updated 2 years ago
- ☆24Updated 11 months ago
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆10Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆27Updated 3 years ago
- This repository contains the Potsdam Textbook Corpus (PoTeC) which is a natural reading eye-tracking corpus.☆12Updated 2 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated last month
- ☆22Updated 3 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- ☆10Updated 2 weeks ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- Organized inventory of research using the Abstract Meaning Representation☆37Updated last week
- A unified interface for computing surprisal (log probabilities) from language models! Supports neural, symbolic, and black-box API models…☆38Updated 4 months ago
- ☆29Updated last year
- A simple, Python-based, command-line runner for MGIZA++.☆11Updated 3 years ago
- ☆18Updated last year
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Updated 10 months ago
- ☆19Updated 3 years ago
- Code and CoarseWSD-20 datasets for "Language Models and Word Sense Disambiguation: An Overview and Analysis"☆25Updated 2 years ago