langtech-bsc / AnonymizationPipelineLinks
Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
☆14Updated 2 years ago
Alternatives and similar repositories for AnonymizationPipeline
Users that are interested in AnonymizationPipeline are comparing it to the libraries listed below
Sorting:
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- ☆15Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆30Updated 3 months ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆35Updated 5 months ago
- NERO-nlp is a PyPI package for biomedical Named Entity (Recognition) Ontology☆12Updated 5 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Generalist and Lightweight Model for Text Classification☆164Updated 5 months ago
- spaCyTurk - trained models & pipelines for Turkish☆22Updated 3 years ago
- Natural Language to SQL Queries in the OMOP CDM Datasets☆11Updated 2 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Framework for working with brat-annotated .ann files☆10Updated 6 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆72Updated last month
- This repository contains a corpus of medical case reports with entity and relation annotations in BioC format.☆18Updated 5 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- ☆47Updated 2 years ago
- ☆43Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- ☆23Updated 2 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- Biomedical Data-to-Text Generation via Fine-Tuning Transformers☆29Updated 3 years ago
- The robust European language model benchmark.☆135Updated this week
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 2 months ago
- ☆23Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Updated 2 months ago