allenai/scispacy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/scispacy)

allenai / scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

☆1,977

Alternatives and similar repositories for scispacy

Users that are interested in scispacy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

medspacy / medspacy
View on GitHub
Library for clinical NLP with spaCy.
☆670Jun 4, 2026Updated last month
allenai / scibert
View on GitHub
A BERT model for scientific text.
☆1,705Feb 22, 2022Updated 4 years ago
Georgetown-IR-Lab / QuickUMLS
View on GitHub
System for Medical Concept Extraction and Linking
☆449Aug 12, 2024Updated last year
jenojp / negspacy
View on GitHub
spaCy pipeline object for negating concepts in text
☆280Apr 20, 2026Updated 3 months ago
chanzuckerberg / MedMentions
View on GitHub
A corpus of Biomedical papers annotated with mentions of UMLS entities.
☆346Nov 9, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dmis-lab / biobert
View on GitHub
Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining
☆2,200Aug 13, 2023Updated 2 years ago
NLPatVCU / medaCy
View on GitHub
Medical Text Mining and Information Extraction with spaCy
☆438Nov 1, 2022Updated 3 years ago
kormilitzin / med7
View on GitHub
☆226Dec 11, 2024Updated last year
ncbi-nlp / bluebert
View on GitHub
BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
☆597Mar 25, 2023Updated 3 years ago
ncbi-nlp / BioSentVec
View on GitHub
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
☆615Aug 15, 2023Updated 2 years ago
explosion / spacy-transformers
View on GitHub
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
☆1,408Mar 27, 2026Updated 3 months ago
EmilyAlsentzer / clinicalBERT
View on GitHub
repository for Publicly Available Clinical BERT Embeddings
☆772Aug 25, 2020Updated 5 years ago
CogStack / MedCAT
View on GitHub
Medical Concept Annotation Tool
☆531Jul 25, 2025Updated 11 months ago
explosion / projects
View on GitHub
🪐 End-to-end NLP workflows from prototype to production
☆1,432Oct 15, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
naver / biobert-pretrained
View on GitHub
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
☆706Jun 2, 2020Updated 6 years ago
ncbi-nlp / BLUE_Benchmark
View on GitHub
BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.
☆298Jan 12, 2022Updated 4 years ago
dmis-lab / bern
View on GitHub
A neural named entity recognition and multi-type normalization tool for biomedical text mining
☆177Apr 18, 2022Updated 4 years ago
gmichalo / UmlsBERT
View on GitHub
☆101Feb 25, 2022Updated 4 years ago
chartbeat-labs / textacy
View on GitHub
NLP, before and after spaCy
☆2,239Sep 22, 2023Updated 2 years ago
huggingface / neuralcoref
View on GitHub
✨Fast Coreference Resolution in spaCy with Neural Networks
☆2,892Apr 13, 2023Updated 3 years ago
jakelever / kindred
View on GitHub
A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).
☆157Mar 12, 2023Updated 3 years ago
allenai / allennlp
View on GitHub
An open-source NLP research library, built on PyTorch.
☆11,888Nov 22, 2022Updated 3 years ago
LHNCBC / metamaplite
View on GitHub
A near real-time named-entity recognizer
☆66May 20, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BaderLab / saber
View on GitHub
Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work …
☆102Jul 14, 2020Updated 6 years ago
allenai / s2orc
View on GitHub
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
☆1,073Apr 26, 2024Updated 2 years ago
flairNLP / flair
View on GitHub
A very simple framework for state-of-the-art Natural Language Processing (NLP)
☆14,382Oct 27, 2025Updated 8 months ago
NIHOPA / NLPre
View on GitHub
Python library for Natural Language Preprocessing (NLPre)
☆190Jul 31, 2023Updated 2 years ago
dmis-lab / BioSyn
View on GitHub
ACL'2020: Biomedical Entity Representations with Synonym Marginalization
☆184Jul 7, 2023Updated 3 years ago
svjan5 / medtype
View on GitHub
MedType: Improving Medical Entity Linking with Semantic Type Prediction
☆114Feb 10, 2023Updated 3 years ago
explosion / spacy-stanza
View on GitHub
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
☆747Aug 15, 2024Updated last year
explosion / spaCy
View on GitHub
💫 Industrial-strength Natural Language Processing (NLP) in Python
☆33,768May 19, 2026Updated 2 months ago
allenai / specter
View on GitHub
SPECTER: Document-level Representation Learning using Citation-informed Transformers
☆586Jun 12, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NorskRegnesentral / skweak
View on GitHub
skweak: A software toolkit for weak supervision applied to NLP tasks
☆925Sep 2, 2024Updated last year
allenai / science-parse
View on GitHub
Science Parse parses scientific papers (in PDF form) and returns them in structured form.
☆701May 26, 2024Updated 2 years ago
explosion / spacy-streamlit
View on GitHub
👑 spaCy building blocks and visualizers for Streamlit apps
☆859Jul 29, 2024Updated last year
ICLRandD / Blackstone
View on GitHub
A spaCy pipeline and model for NLP on unstructured legal text.
☆692Jul 16, 2024Updated 2 years ago
kdpsingh / clinspacy
View on GitHub
Clinical Natural Language Processing using spaCy, scispacy, and medspacy
☆102Apr 24, 2024Updated 2 years ago
snorkel-team / snorkel
View on GitHub
A system for quickly generating training data with weak supervision
☆5,994Jun 8, 2026Updated last month
allenai / scidocs
View on GitHub
Dataset accompanying the SPECTER model
☆148Dec 19, 2022Updated 3 years ago