fhamborg / NewsWCL50Links
The first, open access evaluation dataset for methods to identify bias by word choice and labeling
☆25Updated last month
Alternatives and similar repositories for NewsWCL50
Users that are interested in NewsWCL50 are comparing it to the libraries listed below
Sorting:
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- ☆16Updated 7 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- Wikidata embedding☆51Updated last year
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated 2 years ago
- A web interface to understand language-specific BERT-models☆18Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 10 months ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Updated 7 years ago
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 7 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 7 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 5 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- spaCy-to-naf converter☆21Updated 6 months ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated 2 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated last year
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 7 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 7 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 5 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- ☆59Updated 10 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆63Updated 7 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆23Updated 3 years ago
- Dataset containing Aggregated and anonymized queries from across the world with Coronavirus intent.☆85Updated 4 years ago