LSYS/LexicalRichness

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LSYS/LexicalRichness)

LSYS / LexicalRichness

A module to compute textual lexical richness (aka lexical diversity).

☆113

Alternatives and similar repositories for LexicalRichness

Users that are interested in LexicalRichness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kristopherkyle / lexical_diversity
View on GitHub
This is a simple Python package for calculating a variety of lexical diversity indices
☆84Sep 15, 2023Updated 2 years ago
vivek3141 / ghostbuster-data
View on GitHub
Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"
☆14May 27, 2024Updated 2 years ago
jpwahle / emnlp23-paraphrase-types
View on GitHub
The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"
☆12Oct 20, 2024Updated last year
vered1986 / panic
View on GitHub
PANiC - PAraphrasing Noun-Compounds
☆15Apr 6, 2018Updated 8 years ago
WSE-research / LinguaF
View on GitHub
python package for calculating famous measures in computational linguistics
☆15Jun 29, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws-solutions-library-samples / guidance-for-media-extraction-and-dynamic-content-policy-framework-on-aws
View on GitHub
This Guidance demonstrates how to accelerate your content analysis workflows by automating video metadata extraction, intelligence gather…
☆13Updated this week
josauder / dreambank_visualized
View on GitHub
DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions
☆16Jun 16, 2018Updated 8 years ago
mcdonn / LSA2019-Reproducible-Research
View on GitHub
Satellite Workshop "Tools for Reproducible Research in Linguistics" at the 93rd Annual Meeting of the Linguistics Society of America in N…
☆18Jan 8, 2025Updated last year
thegetty / mutual-muses
View on GitHub
Mutual Muses is a crowdsourced transcription project undertaken by the Digital Art History program at the Getty Research Institute
☆17May 3, 2018Updated 8 years ago
emorynlp / ud-korean
View on GitHub
Universal Dependency Treebanks in Korean
☆39Dec 19, 2021Updated 4 years ago
mullerpeter / authorstyle
View on GitHub
Python package to deal with PAN corpora and extract stylometric features from text documents.
☆15Nov 11, 2022Updated 3 years ago
georgepar / grnet_guide
View on GitHub
Guide for the slp group on how to use the Grnet cluster
☆11Apr 16, 2020Updated 6 years ago
alex-mcleod / py_liwc
View on GitHub
A python implementation of the LIWC program (http://www.liwc.net/).
☆14Feb 26, 2013Updated 13 years ago
MeLeLBGU / SaGe
View on GitHub
Code for SaGe subword tokenizer (EACL 2023)
☆28Nov 30, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
cligs / toolbox
View on GitHub
Collection of small tools for text processing.
☆25Feb 18, 2023Updated 3 years ago
heolin / agreement
View on GitHub
Implementation of popular agreement metrics such as Cohen kappa, Fleiss kappa, Krippendorff alpha
☆16Apr 2, 2022Updated 4 years ago
mitramir55 / PassivePy
View on GitHub
PassivePy: A Tool to Automatically Identify Passive Voice in Big Text Data
☆23Mar 6, 2024Updated 2 years ago
collabora / whisper-finetuning
View on GitHub
Whisper finetuning
☆17Apr 9, 2025Updated last year
lucy3 / whos_filtered
View on GitHub
☆15Oct 4, 2024Updated last year
scrosseye / CLEAR-Corpus
View on GitHub
Repository for the CommonLit Ease of Readability Corpus
☆25Apr 17, 2024Updated 2 years ago
ELI-Data-Mining-Group / PELIC-dataset
View on GitHub
The University of Pittsburgh English Language Institute Corpus (PELIC) dataset
☆28Mar 6, 2026Updated 4 months ago
malteos / aspect-document-similarity
View on GitHub
Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020
☆63Apr 30, 2024Updated 2 years ago
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amatsuo / kaigiroku
View on GitHub
☆11Jun 14, 2022Updated 4 years ago
jpwahle / cs-insights
View on GitHub
The main controller for services in the cs-insights project through docker-compose.
☆13Aug 25, 2023Updated 2 years ago
lintool / Enron2mbox
View on GitHub
Converting the Enron email collection to mbox format
☆12Dec 9, 2016Updated 9 years ago
thefullstackninja / Streamlit_tutorials
View on GitHub
This repo is all about creating sample apps with Streamlit.
☆13Oct 25, 2020Updated 5 years ago
AI4Bharat / IndicMFA
View on GitHub
☆18Sep 13, 2024Updated last year
heartcored98 / transformer_anatomy
View on GitHub
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
☆16Mar 21, 2025Updated last year
charlesweir / LUThesisTemplate
View on GitHub
Word template for a Lancaster University thesis
☆10Mar 19, 2022Updated 4 years ago
H-TayyarMadabushi / CxGBERT-BERT-meets-Construction-Grammar
View on GitHub
Construction Grammar based BERT
☆14Dec 5, 2020Updated 5 years ago
shuizhonghaitong / classification_GAT
View on GitHub
用唐诗知识图谱、带标签的诗词作输入 2层GAT+attention 唐诗题材分类 Tensorflow框架
☆14Oct 29, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tompkinsguitar / counterpoint
View on GitHub
Creates first-species modal counterpoint
☆14Jan 10, 2019Updated 7 years ago
abhimanyudubey / GeoYFCC
View on GitHub
Dataset accompanying the paper "Adaptive Methods for Real-World Domain Generalization"
☆16Aug 17, 2023Updated 2 years ago
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
viirya / flickr_fetcher
View on GitHub
Research codes for image interestingness
☆17Dec 6, 2017Updated 8 years ago
KaniyamFoundation / Pdf2Text
View on GitHub
Project to convert PDF files to Text files using google OCR
☆13May 6, 2024Updated 2 years ago
tuxxy / entropy
View on GitHub
A utility that calculates the Shannon entropy of a given input file
☆14Mar 15, 2022Updated 4 years ago
PacktPublishing / Bash-Scripting-and-Shell-Programming-Linux-Command-Line-
View on GitHub
Code Repository for Bash Scripting and Shell Programming (Linux Command Line), Published by Packt
☆12Jan 30, 2023Updated 3 years ago