kristopherkyle / TAALED
Tool for the Automatic Assessment of Lexical Diversity
☆11Updated 4 years ago
Alternatives and similar repositories for TAALED:
Users that are interested in TAALED are comparing it to the libraries listed below
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 7 months ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- ☆14Updated 4 months ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- ☆17Updated last year
- ☆21Updated 3 weeks ago
- ☆8Updated 7 months ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated 5 months ago
- ☆19Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 6 months ago
- LEMON: Explainable Entity Matching☆18Updated 2 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- ☆11Updated 2 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- ☆12Updated last year
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated last month
- GASP! Dataset - Generating Abstracts of Scientific Papers from Abstracts of Cited Papers☆9Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 8 months ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Neural multi-doc question answering on the CORD-19 dataset☆10Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- ☆22Updated 7 months ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago