langtech-bsc / AnonymizationPipelineLinks
Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
☆17Updated 2 years ago
Alternatives and similar repositories for AnonymizationPipeline
Users that are interested in AnonymizationPipeline are comparing it to the libraries listed below
Sorting:
- Pre-production releases for Spacy in Catalan☆14Updated 4 years ago
- ☆23Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆81Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆43Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆60Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- MAFAND-MT☆60Updated last year
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 4 years ago
- A High-level Library for Named Entity Recognition in Python.☆25Updated 2 years ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Updated 3 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆70Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆38Updated 8 months ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆169Updated 2 weeks ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Updated 3 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆29Updated 4 years ago
- Framework for working with brat-annotated .ann files☆10Updated last month
- ☆17Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago
- NERO-nlp is a PyPI package for biomedical Named Entity (Recognition) Ontology☆12Updated 5 years ago