HPI-Information-Systems / Quagga
An email segmentation system (reference implementation of ECIR 2018 paper)
☆10Updated 4 years ago
Related projects: ⓘ
- An Email Segmentation System☆9Updated 3 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 4 years ago
- Finds linguistic patterns effortlessly☆31Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆83Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆53Updated last year
- TeXoo – A Zoo of Text Extractors☆18Updated 4 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆40Updated 5 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated last year
- Inter-annotator agreement for Doccano☆26Updated 4 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆34Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 5 years ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated this week
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆49Updated 4 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Fast fuzzy text search☆11Updated last year
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆67Updated 3 years ago
- ☆64Updated last year
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- Named entity recognition for the legal domain☆40Updated 3 years ago
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆29Updated last year
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆54Updated 2 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 2 years ago
- Prodigy thing(z)☆13Updated 6 years ago