NIHOPA/word2vec_pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NIHOPA/word2vec_pipeline)

NIHOPA / word2vec_pipeline

NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)

☆116

Alternatives and similar repositories for word2vec_pipeline

Users that are interested in word2vec_pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NIHOPA / NLPre
View on GitHub
Python library for Natural Language Preprocessing (NLPre)
☆190Jul 31, 2023Updated 2 years ago
AVBelyy / Rysearch
View on GitHub
Exploratory search engine based on hierarchical topic models from BigARTM
☆13Mar 8, 2022Updated 4 years ago
arne-cl / ppi_graphkernel
View on GitHub
all-paths graph kernel for protein-protein interaction extraction
☆12Apr 22, 2014Updated 12 years ago
uhh-lt / sensegram
View on GitHub
Making sense embedding out of word embeddings using graph-based word sense induction
☆214May 17, 2021Updated 5 years ago
ecohealthalliance / EpiTator
View on GitHub
EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…
☆43Jun 21, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MGH-LMIC / graynet_keras
View on GitHub
Pretrained parameters for CT deep learning models.
☆13Sep 24, 2019Updated 6 years ago
bobye / acl2017_document_clustering
View on GitHub
code for "Determining Gains Acquired from Word Embedding Quantitatively Using Discrete Distribution Clustering" ACL 2017
☆21Nov 21, 2018Updated 7 years ago
stephenhky / PyShortTextCategorization
View on GitHub
Various Algorithms for Short Text Mining
☆471Updated this week
greenelab / snorkeling
View on GitHub
Extracting biomedical relationships from literature with Snorkel 🏊
☆58Feb 1, 2021Updated 5 years ago
o19s / lazy-semantic-indexing
View on GitHub
Elasticsearch Latent Semantic Indexing experimentation
☆32Oct 18, 2019Updated 6 years ago
thoppe / Federal-AI-inventory-analysis-2023
View on GitHub
Analysis of the projects reported on the Federal inventory for EO 13960
☆20Aug 19, 2025Updated 11 months ago
JasonKessler / scattertext
View on GitHub
Beautiful visualizations of how language differs among document types.
☆2,337Jul 4, 2026Updated 3 weeks ago
RaRe-Technologies / talks
View on GitHub
Presentations & notebooks from our talks /workshops/meetups/etc
☆24Mar 23, 2018Updated 8 years ago
alphagov / govuk-taxonomy-supervised-learning
View on GitHub
Auto-tag govuk content to the collated legacy taxonomies
☆21Sep 16, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
titipata / grant_database
View on GitHub
Downloader, preprocessor, parser and deduper for NIH and NSF grants
☆22Aug 24, 2018Updated 7 years ago
procurement-analytics / procurement-analytics
View on GitHub
A dashboard with insights into Mexico's procurement performance
☆12Jul 17, 2020Updated 6 years ago
allenai / taggers
View on GitHub
Easily identify and label sentence intervals using various taggers.
☆16Feb 1, 2017Updated 9 years ago
ajschumacher / dc_voter_reg
View on GitHub
snapshot of DC voter registration data
☆16Dec 21, 2014Updated 11 years ago
wroberts / fsed
View on GitHub
Aho-Corasick string replacement utility
☆26Nov 25, 2019Updated 6 years ago
Vachik-Dave / Neural-Brane-Neural-Bayesian-Personalized-Ranking-for-Attributed-Network-Embedding
View on GitHub
☆13Aug 13, 2018Updated 7 years ago
vered1986 / OKR
View on GitHub
OKR: A Consolidated Open Knowledge Representation for Multiple Texts
☆41Jan 25, 2018Updated 8 years ago
rock3125 / sentence2vec
View on GitHub
Sentence2vec by Rock
☆311Mar 30, 2025Updated last year
ehsansherkat / ConVec
View on GitHub
In this project, we use skip-gram model to embed Wikipedia Concepts and Entities. The English version of Wikipedia contains more than fiv…
☆57Nov 12, 2017Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
thoppe / RNN_science_titles
View on GitHub
Do you even science, bro? Using RNN's to predict scientific titles.
☆14Jun 5, 2017Updated 9 years ago
luffycodes / attention-word-embedding
View on GitHub
Code for Attention Word Embeddings
☆20Oct 31, 2020Updated 5 years ago
arpit3043 / Extractive-Text-Summerization
View on GitHub
Summarization systems often have additional evidence they can utilize in order to specify the most important topics of document(s). For e…
☆22Sep 1, 2022Updated 3 years ago
openrif / vivo-isf-ontology
View on GitHub
The "VIVO-ISF Ontology" is an OWL2 representation of the VIVO-ISF Data Standard
☆18Mar 13, 2019Updated 7 years ago
Rostlab / nalaf
View on GitHub
NLP framework in python for entity recognition and relationship extraction
☆115Dec 8, 2022Updated 3 years ago
giacbrd / ShallowLearn
View on GitHub
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some addit…
☆198Aug 8, 2017Updated 8 years ago
ai-ku / wvec
View on GitHub
Word vectors
☆63May 26, 2018Updated 8 years ago
tukeyclothespin / scimitar
View on GitHub
Arabic Text Detection in Images
☆15Apr 5, 2018Updated 8 years ago
chartbeat-labs / textacy
View on GitHub
NLP, before and after spaCy
☆2,239Sep 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
agussman / hrc-email
View on GitHub
Tools for analyzing the Hillary Clinton emails
☆13Apr 24, 2016Updated 10 years ago
rockt / SETH
View on GitHub
SNP Extraction Tool for Human Variations
☆27Feb 21, 2024Updated 2 years ago
lisc-tools / lisc
View on GitHub
Literature Scanner: Automated collection & analyses of the scientific literature.
☆111Jun 10, 2026Updated last month
maciejkula / binge
View on GitHub
Recommendation models that use binary rather than floating point operations at prediction time.
☆21Sep 18, 2017Updated 8 years ago
martbert / decomp_attn_keras
View on GitHub
Parikh et al., A Decomposable Attention Model for Natural Inference
☆17Feb 12, 2018Updated 8 years ago
jakelever / kindred
View on GitHub
A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).
☆157Mar 12, 2023Updated 3 years ago
MaxHalford / myriade
View on GitHub
✨🌲 Hierarchical extreme multiclass and multi-label classification.
☆18Jan 5, 2023Updated 3 years ago