Layout-Parser / annotation-service
β16Updated 3 years ago
Alternatives and similar repositories for annotation-service:
Users that are interested in annotation-service are comparing it to the libraries listed below
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β15Updated 6 months ago
- Finds linguistic patterns effortlesslyβ35Updated last year
- πΈ Train floret vectorsβ18Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.β35Updated 4 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β31Updated 3 years ago
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β22Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β26Updated 2 years ago
- Post-processing OCR errors with seq2seq modelsβ28Updated 4 years ago
- β12Updated 10 months ago
- Small python package to measure OCR quality and other related metrics.β21Updated last year
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.β26Updated 3 years ago
- Keras Implementation of Flair's Contextualized Embeddingsβ26Updated 3 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- β54Updated last year
- Converter from UD-trees to BART representationβ36Updated last year
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2β¦β13Updated last year
- Large-scale query-focused multi-document Summarization datasetβ10Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or fβ¦β24Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β22Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)β53Updated last year
- Tool for parsing and converting various span encoding schemes.β22Updated last year
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-serβ¦β44Updated 5 months ago
- A simple library for training named entity recognition model from partially annotated dataβ23Updated last year
- A Named-Entity Recogniser based on Grobid.β50Updated 5 months ago
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Pythonβ12Updated 2 years ago
- Interpretable feature construction from taxonomies for text classificationβ18Updated 2 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.β16Updated last year
- sequence tagging with spaCy and crfsuiteβ19Updated last year