fredriko / nlp-data-readinessLinks
This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.
☆12Updated 4 years ago
Alternatives and similar repositories for nlp-data-readiness
Users that are interested in nlp-data-readiness are comparing it to the libraries listed below
Sorting:
- German GPT-2 model☆32Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆82Updated last year
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 3 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 4 years ago
- Wikidata embedding☆51Updated 11 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 4 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆148Updated last year
- Passive/Active sentence Transformer☆28Updated 7 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- ☆64Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 7 years ago
- A large (>5k) collection of search questions asked about Coronavirus 🦠☆14Updated 5 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆152Updated this week
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- spaCy + UDPipe☆163Updated 3 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆112Updated 4 months ago
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago