davidjurgens / potato
potato: portable text annotation tool
☆323Updated this week
Alternatives and similar repositories for potato:
Users that are interested in potato are comparing it to the libraries listed below
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆178Updated last year
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆285Updated 5 months ago
- Package to extract connotation frames☆83Updated last year
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆273Updated last week
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆213Updated 4 months ago
- ☆52Updated last year
- Pretraining Efficiently on S2ORC!☆158Updated 5 months ago
- ☆293Updated 2 years ago
- All-in-one text de-duplication☆664Updated 10 months ago
- A Python Search Engine for Humans 🥸☆212Updated 11 months ago
- ☆158Updated 9 months ago
- Data and models for the SciFact verification task.☆228Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- ☆23Updated last year
- FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning. Presented at EACL 2023.☆27Updated last year
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- String-to-String Algorithms for Natural Language Processing☆541Updated 7 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆359Updated 11 months ago
- A reading list of up-to-date papers on NLP for Social Good.☆301Updated last year
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆35Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆124Updated last year
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆95Updated last year
- ☆47Updated 2 years ago
- ☆38Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆118Updated 7 months ago
- Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).☆476Updated last month
- Resources for cultural NLP research☆86Updated 2 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆345Updated 2 years ago
- ☆34Updated 5 months ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆390Updated 9 months ago