McGill-NLP / medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
☆266Updated last year
Alternatives and similar repositories for medal:
Users that are interested in medal are comparing it to the libraries listed below
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆146Updated last year
- ☆214Updated 3 months ago
- A project for developing transformer-based models for clinical relation extraction☆127Updated last year
- Recognize bio-medical entities from a text corpus☆119Updated last year
- ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)☆411Updated 2 years ago
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆378Updated last year
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆158Updated 3 years ago
- A collection of papers on automated medical coding from free-texts☆136Updated 3 months ago
- Pre-trained Language Model for Biomedical Question Answering☆122Updated 2 years ago
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆293Updated 3 years ago
- Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)☆59Updated 2 years ago
- ☆97Updated 3 years ago
- Library for clinical NLP with spaCy.☆554Updated 3 weeks ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 5 months ago
- Code for the EACL 2021 Paper: Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration☆93Updated 7 months ago
- Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems☆280Updated last year
- Code for the emrQA question answering dataset☆146Updated 3 years ago
- all scripts used in gatortron project☆114Updated last year
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆90Updated 4 years ago
- ☆59Updated last year
- Tools for curating biomedical training data for large-scale language modeling☆471Updated 3 months ago
- CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]☆78Updated 2 years ago
- A simple interface to inspect, improve and add concepts to biomedical NER+L -> MedCAT.☆79Updated this week
- Weakly supervised medical named entity classification☆72Updated 2 years ago
- Medical Text Mining and Information Extraction with spaCy☆434Updated 2 years ago
- ☆38Updated 3 years ago
- A corpus of Biomedical papers annotated with mentions of UMLS entities.☆322Updated 3 years ago
- ICD-BERT: Multi-label Classification of ICD-10 Codes with BERT (CLEF 2019)☆74Updated 2 years ago
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆569Updated 2 years ago
- ACL'2020: Biomedical Entity Representations with Synonym Marginalization☆163Updated last year