Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
☆17Nov 10, 2023Updated 2 years ago
Alternatives and similar repositories for AnonymizationPipeline
Users that are interested in AnonymizationPipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A light-weighted UMLS-based data augmentation for biomedical NLP tasks including Named Entity Recognition and sentence classification.☆10Apr 6, 2021Updated 5 years ago
- Pre-production releases for Spacy in Catalan☆14Nov 30, 2021Updated 4 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP☆17Sep 23, 2024Updated last year
- ☆21May 1, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An active annotation tool based on brat(https://github.com/nlplab/brat)☆19Aug 22, 2017Updated 8 years ago
- ☆18Sep 24, 2024Updated last year
- The code to perform Sequence Labelling with LLMs, including T5, FLAN, LLaMA, Alpaca and more!☆14Nov 5, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- DReAMy: a library for dream-reports annotation methods with python, NLP, and LLMs☆16Jun 6, 2024Updated last year
- ☆31Jun 12, 2023Updated 2 years ago
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Nov 1, 2025Updated 6 months ago
- A list of cheatsheets for different stuff (based on many sources)☆11Mar 29, 2016Updated 10 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- wav2rtp is a simple tool intended to convert speech data from wav files to RTP data stream☆14Aug 15, 2021Updated 4 years ago
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.☆12Jun 23, 2020Updated 5 years ago
- Extrator de entidades mencionadas em notícias da mídia☆15May 25, 2021Updated 4 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Experimentos com flask☆11Jan 29, 2023Updated 3 years ago
- Small projects using the OpenAI API.☆13Mar 21, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆14Oct 14, 2025Updated 7 months ago
- Yet another heatmap generator for rtl_power csv file☆11Aug 24, 2025Updated 9 months ago
- ☆11Aug 8, 2018Updated 7 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆23Apr 2, 2026Updated last month
- ☆13Nov 16, 2022Updated 3 years ago
- Python library for GeneiaTagger☆10May 7, 2015Updated 11 years ago
- Freeswitch Wiki☆11Apr 2, 2019Updated 7 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Jan 1, 2023Updated 3 years ago
- Hands-On Machine Learning using JavaScript, published by Packt☆13Jan 1, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Feb 9, 2024Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated last month
- vad algorithm based on esp32 for mute detection☆13Dec 9, 2018Updated 7 years ago
- ✖️MEN - A Modular Toolkit for Cross-Lingual Medical Entity Normalization☆32Dec 28, 2024Updated last year
- Converts brat standoff format to JSONL format☆13Jan 29, 2022Updated 4 years ago
- Bilingual word translation straight from Linguee website☆16Feb 10, 2019Updated 7 years ago