JohnSnowLabs / spark-nlp
State of the Art Natural Language Processing
β3,895Updated this week
Alternatives and similar repositories for spark-nlp:
Users that are interested in spark-nlp are comparing it to the libraries listed below
- Public runnable examples of using John Snow Labs' NLP for Apache Spark.β1,047Updated this week
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,357Updated this week
- Top2Vec learns jointly embedded topic, document and word vectors.β2,973Updated 2 months ago
- NLP, before and after spaCyβ2,214Updated last year
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languagesβ7,337Updated this week
- Simple and Distributed Machine Learningβ5,092Updated last week
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,866Updated last year
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of langβ¦β1,512Updated last month
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,240Updated 3 months ago
- Learning embeddings for classification, retrieval and ranking.β3,952Updated 2 years ago
- Language-Agnostic SEntence Representationsβ3,608Updated 8 months ago
- MLeap: Deploy ML Pipelines to Productionβ1,504Updated last month
- π¦ Contextually-keyed word vectorsβ1,633Updated 10 months ago
- Natural Language Processing Best Practices & Examplesβ6,386Updated 2 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conveβ¦β4,137Updated 7 months ago
- A full spaCy pipeline and models for scientific/biomedical documents.β1,734Updated last month
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ9,258Updated this week
- State-of-the-Art Text Embeddingsβ15,772Updated last week
- A system for quickly generating training data with weak supervisionβ5,826Updated 8 months ago
- π©βπ« Advanced NLP with spaCy: A free online courseβ2,334Updated 2 months ago
- Beautiful visualizations of how language differs among document types.β2,272Updated 3 months ago
- Single-document unsupervised keyword extractionβ1,668Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)β1,105Updated 4 months ago
- Papers & presentation materials from Hugging Face's internal science dayβ2,041Updated 4 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understandingβ6,180Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,016Updated this week
- An open-source NLP research library, built on PyTorch.β11,782Updated 2 years ago
- Python Keyphrase Extraction moduleβ1,571Updated last year
- brat rapid annotation tool (brat) - for all your textual annotation needsβ1,837Updated 6 months ago
- Data augmentation for NLPβ4,491Updated 6 months ago