HPI-Information-Systems / Quagga
An email segmentation system (reference implementation of ECIR 2018 paper)
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Quagga
- An Email Segmentation System☆9Updated 4 years ago
- Finds linguistic patterns effortlessly☆33Updated last year
- TeXoo – A Zoo of Text Extractors☆18Updated 4 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 4 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Named entity recognition for the legal domain☆40Updated 3 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- A small Python library for NLP Interchange Format (NIF) for NER(D) systems☆19Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- ☆16Updated 9 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆40Updated last month
- ☆19Updated 6 years ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆19Updated 4 months ago
- Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for …☆9Updated 5 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- Collaborative Synchronized Corpus Annotation Tool☆10Updated 5 years ago
- spaCy-to-naf converter☆21Updated 5 months ago
- ☆18Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Fast fuzzy text search☆11Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆19Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- For extracting measurements and related entities from text☆56Updated 4 years ago