A module to compute textual lexical richness (aka lexical diversity).
☆112Aug 27, 2023Updated 2 years ago
Alternatives and similar repositories for LexicalRichness
Users that are interested in LexicalRichness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a simple Python package for calculating a variety of lexical diversity indices☆82Sep 15, 2023Updated 2 years ago
- Tool for the automatic assessment of lexical diversity☆14Sep 6, 2025Updated 7 months ago
- Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"☆14May 27, 2024Updated last year
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- A general-purpose NLP pipeline for Ancient Greek☆28Mar 26, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- python package for calculating famous measures in computational linguistics☆15Nov 5, 2024Updated last year
- Whisper finetuning☆16Apr 9, 2025Updated last year
- An easy-to-use library to extract indices from texts.☆30Sep 7, 2021Updated 4 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Ancient greek dictionary☆12Feb 14, 2016Updated 10 years ago
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Dec 13, 2018Updated 7 years ago
- Universal Dependency Treebanks in Korean☆38Dec 19, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- ☆15Dec 1, 2020Updated 5 years ago
- Common Lisp Package for Parallel Corpus Processing☆13Feb 17, 2024Updated 2 years ago
- The Ancient Greek dictionary for Hunspell (grc_GR for Notepad++, Google Chrome, Vivaldi etc).☆13May 18, 2022Updated 3 years ago
- Demo server for TREC LiveQA competition☆11Dec 7, 2016Updated 9 years ago
- ☆15Oct 4, 2024Updated last year
- A phonics API for the English language.☆14Oct 25, 2015Updated 10 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated 2 years ago
- The main controller for services in the cs-insights project through docker-compose.☆13Aug 25, 2023Updated 2 years ago
- Code for the ILNewsDiff Twitter account☆10May 23, 2023Updated 2 years ago
- GW2 inventory cleanup tool☆16Apr 5, 2025Updated last year
- [NeurIPS 2021] Open Rule Induction☆19May 22, 2022Updated 3 years ago
- The University of Pittsburgh English Language Institute Corpus (PELIC) dataset☆28Mar 6, 2026Updated last month
- Fragments-Expert is a software package for feature extraction from file fragments and classification among various file formats.☆13Jan 16, 2024Updated 2 years ago
- Dataset of the Samaritan Pentateuch☆12Apr 15, 2026Updated 2 weeks ago
- Converting the Enron email collection to mbox format☆11Dec 9, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Transform Greek and Latin texts into morphology databases using Perseus' Morpheus service.☆17Aug 8, 2014Updated 11 years ago
- In this project, word and document embeddings are generated for the sentiment classification task.☆10May 29, 2025Updated 11 months ago
- Code for the ACL 2022 paper "Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning"☆37Dec 5, 2022Updated 3 years ago
- Reference list of email processing resources; focus on preservation and PII handling☆15Apr 20, 2022Updated 4 years ago
- Tropy plugin to import IIIF manifests☆17Mar 11, 2026Updated last month
- Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.☆13Jan 5, 2023Updated 3 years ago
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020☆16Mar 21, 2025Updated last year