McGill-NLP / medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
☆259Updated last year
Alternatives and similar repositories for medal:
Users that are interested in medal are comparing it to the libraries listed below
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆146Updated last year
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆372Updated last year
- A project for developing transformer-based models for clinical relation extraction☆127Updated last year
- ☆212Updated 2 months ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆158Updated 3 years ago
- Code for the emrQA question answering dataset☆144Updated 3 years ago
- ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)☆401Updated 2 years ago
- Tools for curating biomedical training data for large-scale language modeling☆467Updated 2 months ago
- Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems☆272Updated last year
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆290Updated 3 years ago
- Recognize bio-medical entities from a text corpus☆119Updated last year
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆89Updated 4 years ago
- Pre-trained Language Model for Biomedical Question Answering☆122Updated last year
- Code for the EACL 2021 Paper: Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration☆91Updated 6 months ago
- A collection of papers on automated medical coding from free-texts☆129Updated last month
- ☆96Updated 2 years ago
- ☆38Updated 2 years ago
- Weakly supervised medical named entity classification☆70Updated 2 years ago
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆566Updated last year
- all scripts used in gatortron project☆114Updated last year
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆26Updated last year
- ICD-BERT: Multi-label Classification of ICD-10 Codes with BERT (CLEF 2019)☆74Updated last year
- NER and Relation Extraction from Electronic Health Records (EHR).☆85Updated 2 years ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 3 months ago
- Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)☆59Updated 2 years ago
- Labeled dataset of similar and dissimilar medical question pairs created by Curai's doctors☆20Updated 4 years ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆35Updated last year
- Library for clinical NLP with spaCy.☆547Updated 2 months ago
- CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]☆79Updated 2 years ago
- System for Medical Concept Extraction and Linking☆394Updated 6 months ago