This is a simple Python package for calculating a variety of lexical diversity indices
☆82Sep 15, 2023Updated 2 years ago
Alternatives and similar repositories for lexical_diversity
Users that are interested in lexical_diversity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keywords: lexical diversity MTLD HDD vocabulary type token python☆17Apr 26, 2017Updated 8 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆112Aug 27, 2023Updated 2 years ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆31Nov 4, 2023Updated 2 years ago
- ☆24Aug 24, 2023Updated 2 years ago
- 🖋 Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024☆21Feb 17, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Comparing sequential forecasters via confidence sequences & e-processes☆10Oct 24, 2023Updated 2 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 3 years ago
- Corpus of Annotations for Misspelings☆28Jul 31, 2023Updated 2 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆24Jan 12, 2025Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Feb 21, 2026Updated last month
- [BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news dete…☆149Dec 3, 2024Updated last year
- 基于Chinese Open Wordnet实现上下位关系自动抽取☆12May 15, 2020Updated 5 years ago
- Ancient greek dictionary☆12Feb 14, 2016Updated 10 years ago
- Providing a reactivity system similar to Vue.js for Python.☆16Sep 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Extract links from Wikipedia pages to create a cross-document coreference dataset (multilingual support)☆11Apr 13, 2023Updated 2 years ago
- Exploring the idea of a generic, language agnostic, CEFR level classifier☆23Apr 13, 2018Updated 7 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆61May 31, 2023Updated 2 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆30Sep 20, 2025Updated 6 months ago
- End-to-end shallow discourse parser☆25Jun 12, 2023Updated 2 years ago
- Modified Python3 P2FA for Mandarin☆10Sep 21, 2020Updated 5 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆20Feb 8, 2026Updated last month
- NILC-Metrix gathers the metrics developed over more than a decade in NILC Lab.☆15Feb 23, 2026Updated last month
- Perspectrum: a dataset of claims, perspectives and evidence documents☆34Jan 16, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Chrome Extension that adds an active forks section on a Github Page☆14Mar 19, 2022Updated 4 years ago
- ☆11Jun 14, 2022Updated 3 years ago
- Notebook which provides an overview to several text summarization techniques☆11Mar 22, 2019Updated 7 years ago
- Entity and syntax experiments for assessing coherence☆27Nov 12, 2018Updated 7 years ago
- load word embeddings to Torch.Tensor☆14May 12, 2016Updated 9 years ago
- ☆12Apr 2, 2024Updated last year
- An easy-to-use library to extract indices from texts.☆30Sep 7, 2021Updated 4 years ago
- Repository for Vajjala & Lucic (2018)☆67Feb 15, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Tropy plugin to import IIIF manifests☆17Mar 11, 2026Updated 2 weeks ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆402Sep 15, 2024Updated last year
- Findings in ACL 2023☆10Dec 5, 2023Updated 2 years ago
- A NLP team project for finding the cross bilingual embeddings (dictionary) for English and Hindi.☆10Feb 23, 2018Updated 8 years ago
- code for "GLEN: General-Purpose Event Detection for Thousands of Types"☆13Nov 6, 2023Updated 2 years ago