davidjurgens / potato
potato: portable text annotation tool
☆296Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for potato
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆167Updated last year
- ☆293Updated last year
- Code, data, and models for "POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection"☆29Updated 3 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆188Updated 2 months ago
- ☆208Updated 8 months ago
- Tools for checking ACL paper submissions☆598Updated 2 weeks ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆101Updated 9 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆324Updated 2 years ago
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆258Updated last month
- ☆190Updated 5 months ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆34Updated last year
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆89Updated last year
- All-in-one text de-duplication☆618Updated 5 months ago
- A Framework for Textual Entailment based Zero Shot text classification☆154Updated 7 months ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆100Updated 9 months ago
- Data and models for the SciFact verification task.☆225Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆193Updated 8 months ago
- multimodal document analysis☆159Updated 5 months ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆284Updated last year
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆289Updated 2 years ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆347Updated 6 months ago
- A reading list of up-to-date papers on NLP for Social Good.☆283Updated last year
- ☆229Updated 3 years ago
- ☆202Updated last year
- Automatically detect errors in annotated corpora.☆47Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆457Updated last month
- Pretraining Efficiently on S2ORC!☆136Updated 2 weeks ago
- Codebase, data and models for the SummaC paper in TACL☆85Updated 10 months ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆105Updated last month