fhamborg / NewsWCL50Links
The first, open access evaluation dataset for methods to identify bias by word choice and labeling
☆25Updated 2 years ago
Alternatives and similar repositories for NewsWCL50
Users that are interested in NewsWCL50 are comparing it to the libraries listed below
Sorting:
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆111Updated 3 months ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 3 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 2 months ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated last year
- A web application tagging and retrieval of arguments in text☆29Updated 2 years ago
- Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.☆28Updated 8 years ago
- German GPT-2 model☆32Updated 4 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 5 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆82Updated last year
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆57Updated 10 months ago
- Harassment Lexicon and Corpus☆30Updated 7 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17Updated 5 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearch☆30Updated 5 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 7 years ago
- Use BERT to Fill in the Blanks☆83Updated 3 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆98Updated 3 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated 2 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 6 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago