Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
β16Nov 10, 2023Updated 2 years ago
Alternatives and similar repositories for AnonymizationPipeline
Users that are interested in AnonymizationPipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)β16Sep 29, 2024Updated last year
- π A Prodigy plugin for evaluating spaCy pipelinesβ13Mar 26, 2024Updated last year
- A light-weighted UMLS-based data augmentation for biomedical NLP tasks including Named Entity Recognition and sentence classification.β10Apr 6, 2021Updated 4 years ago
- Pre-production releases for Spacy in Catalanβ14Nov 30, 2021Updated 4 years ago
- Fast word segmentation with a focus on splitting #hashtagsβ14Sep 29, 2021Updated 4 years ago
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)β18Oct 13, 2022Updated 3 years ago
- β17Sep 24, 2024Updated last year
- Some basic tools for interacting with `tcf-agent`β11Jan 19, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 4 years ago
- β11Nov 5, 2021Updated 4 years ago
- DReAMy: a library for dream-reports annotation methods with python, NLP, and LLMsβ16Jun 6, 2024Updated last year
- A High-level Library for Named Entity Recognition in Python.β25Dec 7, 2023Updated 2 years ago
- Entry for JS13k 2021β10Sep 12, 2021Updated 4 years ago
- Pencil.js β€οΈ Vue - Build reactive 2D graphics scene in your Vue projectβ11Nov 19, 2020Updated 5 years ago
- PAVOQUE Corpus of Expressive Speechβ12Aug 2, 2016Updated 9 years ago
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- Apertium linguistic data for Catalanβ11Mar 13, 2026Updated last week
- Portuguese voice2json profile based on Pocketsphinxβ11Jul 15, 2020Updated 5 years ago
- β10Nov 1, 2025Updated 4 months ago
- A list of cheatsheets for different stuff (based on many sources)β11Mar 29, 2016Updated 9 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)β10Jun 2, 2021Updated 4 years ago
- wav2rtp is a simple tool intended to convert speech data from wav files to RTP data streamβ14Aug 15, 2021Updated 4 years ago
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.β12Jun 23, 2020Updated 5 years ago
- Experimentos com flaskβ11Jan 29, 2023Updated 3 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021β11Jun 13, 2021Updated 4 years ago
- Small projects using the OpenAI API.β13Mar 21, 2025Updated last year
- This repository contains the data and code created under the project NLP4Rare-cm-uc3m.β10Sep 14, 2021Updated 4 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portugueseβ13Oct 14, 2025Updated 5 months ago
- β11Aug 8, 2018Updated 7 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`β21Jan 8, 2026Updated 2 months ago
- β13Nov 16, 2022Updated 3 years ago
- Extract wav from pcap (rtp)β14Jul 17, 2018Updated 7 years ago
- Python library for GeneiaTaggerβ10May 7, 2015Updated 10 years ago
- Freeswitch Wikiβ11Apr 2, 2019Updated 6 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biomeβ¦β12Jan 1, 2023Updated 3 years ago
- β16Feb 9, 2024Updated 2 years ago
- β12Oct 12, 2023Updated 2 years ago
- vad algorithm based on esp32 for mute detectionβ13Dec 9, 2018Updated 7 years ago