McGill-NLP / medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
☆247Updated last year
Related projects ⓘ
Alternatives and complementary repositories for medal
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆144Updated last year
- Code for the EACL 2021 Paper: Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration☆87Updated 3 months ago
- ☆204Updated 2 years ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆157Updated 3 years ago
- Code for the emrQA question answering dataset☆143Updated 2 years ago
- ☆95Updated 2 years ago
- A collection of papers on automated medical coding from free-texts☆122Updated last month
- A project for developing transformer-based models for clinical relation extraction☆127Updated last year
- Pre-trained Language Model for Biomedical Question Answering☆122Updated last year
- ☆38Updated 2 years ago
- Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)☆58Updated 2 years ago
- Transformers for Clinical NLP☆21Updated this week
- Weakly supervised medical named entity classification☆70Updated last year
- ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)☆381Updated 2 years ago
- [NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.☆174Updated last year
- ☆56Updated last year
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆34Updated 9 months ago
- Tools for curating biomedical training data for large-scale language modeling☆459Updated 3 weeks ago
- Recognize bio-medical entities from a text corpus☆116Updated last year
- A Python Natural Language Processing Toolkit for Medical Text Generation☆70Updated 3 weeks ago
- Library for clinical NLP with spaCy.☆535Updated 3 weeks ago
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆83Updated 4 years ago
- SciFive: a text-text transformer model for biomedical literature☆90Updated 5 months ago
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆24Updated last year
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆354Updated last year
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆286Updated 2 years ago
- ☆74Updated 2 months ago
- A new collection of 1.7k doctor-patient conversations and corresponding clinical notes/summaries.☆56Updated last year
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆558Updated last year
- MedType: Improving Medical Entity Linking with Semantic Type Prediction☆116Updated last year