davidjurgens / potato
potato: portable text annotation tool
☆330Updated 2 weeks ago
Alternatives and similar repositories for potato
Users that are interested in potato are comparing it to the libraries listed below
Sorting:
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆180Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆218Updated 5 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆349Updated 2 years ago
- ☆52Updated last year
- ☆291Updated 2 years ago
- Interpretable Evaluation for AI Systems☆366Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆93Updated 3 months ago
- ☆34Updated 7 months ago
- Data and models for the SciFact verification task.☆232Updated last year
- Package to extract connotation frames☆85Updated last year
- ☆23Updated 2 years ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆288Updated 2 years ago
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆288Updated 7 months ago
- A Python Search Engine for Humans 🥸☆218Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆118Updated 9 months ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆181Updated 2 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- All-in-one text de-duplication☆674Updated 11 months ago
- ☆92Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆206Updated last year
- A Topic Modeling System Toolkit (ACL 2024 Demo)☆250Updated last month
- multimodal document analysis☆164Updated 11 months ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆82Updated last year
- ☆39Updated last year
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆290Updated last month
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆95Updated last year
- Pretraining Efficiently on S2ORC!☆163Updated 6 months ago
- Shared task hosted by IBM in the ArgMining workshop in EMNLP☆29Updated 3 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year