kristopherkyle / TAALED
Tool for the Automatic Assessment of Lexical Diversity
☆11Updated 3 years ago
Related projects: ⓘ
- ☆12Updated last year
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 2 months ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆12Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- ☆19Updated 2 years ago
- ☆19Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆30Updated 3 months ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆11Updated 3 years ago
- python package for calculating famous measures in computational linguistics☆13Updated 4 months ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated last year
- ☆13Updated 3 months ago
- ☆22Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Converter from UD-trees to BART representation☆37Updated 6 months ago
- ☆13Updated last month
- Large-scale query-focused multi-document Summarization dataset☆11Updated 2 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆12Updated last year
- ☆21Updated 2 months ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Tool for sentiment analysis annotation☆11Updated 6 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated last year
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated this week
- Code for GenAug: Data Augmentation for Finetuning Text Generators.☆25Updated 2 years ago
- This repository implements the interaction with DBLP, information extraction and pre-processing of papers, and a client to store data to …☆10Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 4 months ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 3 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆18Updated 10 months ago