McGill-NLP / medalView external linksLinks
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
☆284Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for medal
Users that are interested in medal are comparing it to the libraries listed below
Sorting:
- init☆13Dec 4, 2024Updated last year
- A corpus of Biomedical papers annotated with mentions of UMLS entities.☆342Nov 9, 2021Updated 4 years ago
- Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems☆313Oct 17, 2023Updated 2 years ago
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆588Mar 25, 2023Updated 2 years ago
- ☆63Jul 4, 2023Updated 2 years ago
- Dataset containing 7,025 discharge summary notes from the MIMIC III dataset annotated for 7 SBDHs☆18Jun 18, 2022Updated 3 years ago
- Medical Concept Annotation Tool☆522Jul 25, 2025Updated 6 months ago
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆155Sep 13, 2023Updated 2 years ago
- A rule-based Python module for spitting documents into sections.☆12Nov 14, 2020Updated 5 years ago
- Code for the emrQA question answering dataset☆152Feb 9, 2022Updated 4 years ago
- New approach to use Snomed-CT Concept using Word Embedding with Word2vec☆21Feb 27, 2019Updated 6 years ago
- ☆221Dec 11, 2024Updated last year
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆296Jan 12, 2022Updated 4 years ago
- Library for clinical NLP with spaCy.☆632Aug 4, 2025Updated 6 months ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Sep 22, 2025Updated 4 months ago
- ☆101Feb 25, 2022Updated 3 years ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆163Jul 29, 2021Updated 4 years ago
- Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…☆17Jul 5, 2024Updated last year
- [in development] Tools to support Natural Language Processing of freetext to create structured data elements for analysis☆34Jan 2, 2024Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆379Apr 21, 2023Updated 2 years ago
- ☆38Mar 27, 2022Updated 3 years ago
- This repository contains the code used for distillation and fine-tuning of compact biomedical transformers that have been introduced in t…☆19Mar 26, 2024Updated last year
- System for Medical Concept Extraction and Linking☆432Aug 12, 2024Updated last year
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Jun 12, 2023Updated 2 years ago
- Biomedical concept relatedness benchmark sampled from electronic health records☆11Jul 14, 2022Updated 3 years ago
- Code for the paper "Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset" (ACL 2020)☆17May 9, 2020Updated 5 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- ☆66Feb 27, 2021Updated 4 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆142Nov 17, 2025Updated 2 months ago
- Python package for machine learning for healthcare using a OMOP common data model☆111Jun 17, 2023Updated 2 years ago
- MedTagger is a light weight clinical NLP system built upon Apache UIMA.☆71May 5, 2025Updated 9 months ago
- A comprehensive NLP preprocessing package for clinical notes sentence boundary detection, tokenization☆32May 22, 2024Updated last year
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆26Nov 8, 2023Updated 2 years ago
- EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?☆57Mar 9, 2023Updated 2 years ago
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆439Oct 17, 2023Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆199Oct 3, 2023Updated 2 years ago
- OMOP <-> FHIR mapper☆11Mar 6, 2023Updated 2 years ago
- A collection of papers on automated medical coding from free-texts☆153Dec 21, 2024Updated last year