salgadev / medical-nlp
Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.
☆93Updated 4 years ago
Alternatives and similar repositories for medical-nlp
Users that are interested in medical-nlp are comparing it to the libraries listed below
Sorting:
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆147Updated last year
- A Python Natural Language Processing Toolkit for Medical Text Generation☆79Updated last week
- Recognize bio-medical entities from a text corpus☆120Updated last year
- Sentence tokenizer for clinical/medical text.☆26Updated 11 months ago
- Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)☆61Updated 2 years ago
- Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medic…☆271Updated last year
- Transformers for Clinical NLP☆24Updated 2 weeks ago
- PLM-ICD: Automatic ICD Coding with Pretrained Language Models☆66Updated last year
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Updated last year
- A project for developing transformer-based models for clinical relation extraction☆127Updated last year
- Clinical text summarization by adapting large language models☆141Updated 9 months ago
- Robust de-identification of medical notes using transformer architectures☆52Updated 2 years ago
- This repository contains the code used for distillation and fine-tuning of compact biomedical transformers that have been introduced in t…☆18Updated last year
- all scripts used in gatortron project☆116Updated last year
- ☆217Updated 5 months ago
- Biomedical Named Entity Recognition and Normalization of Diseases, Chemicals and Genenetic entity classes through the use of state-of-the…☆115Updated 3 years ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆160Updated 3 years ago
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆35Updated last year
- Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.☆59Updated 2 years ago
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆29Updated last year
- A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.☆85Updated 11 months ago
- PubMed PICO Element Detection Dataset☆56Updated 6 years ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆45Updated last year
- ☆59Updated last year
- A collection of papers on automated medical coding from free-texts☆140Updated 4 months ago
- ☆98Updated 3 years ago
- ☆18Updated last year
- ☆64Updated last year
- Quantifying biases in BERT embeddings pretrained on MIMIC-III clinical notes☆24Updated 4 years ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆37Updated last year