fredriko / nlp-data-readiness
This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for nlp-data-readiness
- Finds linguistic patterns effortlessly☆33Updated last year
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Bots for reviewing the credibility of web content: articles, tweets, sentences and websites☆9Updated last year
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 5 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated 8 months ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆57Updated 7 months ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated last year
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- A web interface to understand language-specific BERT-models☆17Updated 6 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated last month
- A PyPI package for easy text annotation in a Jupyter Notebook.☆27Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 4 months ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Wikidata embedding☆50Updated last week
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆21Updated 3 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- sequence tagging with spaCy and crfsuite☆18Updated last year
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- Neural multi-doc question answering on the CORD-19 dataset☆10Updated 4 years ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated last month