McGill-NLP / medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
☆270Updated last year
Alternatives and similar repositories for medal:
Users that are interested in medal are comparing it to the libraries listed below
- ☆217Updated 5 months ago
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆147Updated last year
- Medical Concept Annotation Tool☆487Updated last week
- Recognize bio-medical entities from a text corpus☆120Updated last year
- A project for developing transformer-based models for clinical relation extraction☆127Updated last year
- Tools for curating biomedical training data for large-scale language modeling☆477Updated 5 months ago
- Weakly supervised medical named entity classification☆73Updated 2 years ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆160Updated 3 years ago
- ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)☆415Updated 2 years ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆78Updated this week
- A collection of papers on automated medical coding from free-texts☆139Updated 4 months ago
- ☆98Updated 3 years ago
- Code for the EACL 2021 Paper: Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration☆95Updated 9 months ago
- Code for the emrQA question answering dataset☆148Updated 3 years ago
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆571Updated 2 years ago
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆294Updated 3 years ago
- A corpus of Biomedical papers annotated with mentions of UMLS entities.☆326Updated 3 years ago
- System for Medical Concept Extraction and Linking☆402Updated 8 months ago
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆387Updated last year
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆28Updated last year
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆93Updated 4 years ago
- Pre-trained Language Model for Biomedical Question Answering☆122Updated 2 years ago
- A simple interface to inspect, improve and add concepts to biomedical NER+L -> MedCAT.☆80Updated last week
- Library for converting clinical trial eligibility criteria to a machine-readable format.☆171Updated 3 years ago
- ☆59Updated last year
- Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)☆61Updated 2 years ago
- ACL'2020: Biomedical Entity Representations with Synonym Marginalization☆164Updated last year
- ☆38Updated 3 years ago
- SciFive: a text-text transformer model for biomedical literature☆94Updated 11 months ago
- [NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.☆187Updated 2 years ago