explosion / projectsLinks
πͺ End-to-end NLP workflows from prototype to production
β1,405Updated last year
Alternatives and similar repositories for projects
Users that are interested in projects are comparing it to the libraries listed below
Sorting:
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,402Updated last month
- π³ Recipes for the Prodigy, our fully scriptable annotation toolβ502Updated last year
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ742Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.β1,753Updated last year
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.β947Updated 10 months ago
- Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.β591Updated 9 months ago
- NLP, before and after spaCyβ2,233Updated 2 years ago
- BookNLP, a natural language processing pipeline for booksβ881Updated last year
- A spaCy pipeline and model for NLP on unstructured legal text.β667Updated last year
- π spaCy building blocks and visualizers for Streamlit appsβ849Updated last year
- ktrain is a Python library that makes deep learning and AI more accessible and easier to applyβ1,264Updated 10 months ago
- π¦ Contextually-keyed word vectorsβ1,666Updated 7 months ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ400Updated 4 years ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of langβ¦β1,555Updated 6 months ago
- π§Ή Python package for text cleaningβ998Updated 2 years ago
- Information extraction from English and German texts based on predicate logicβ393Updated 3 years ago
- Compute Sentence Embeddings Fast!β624Updated 2 years ago
- A full spaCy pipeline and models for scientific/biomedical documents.β1,906Updated last week
- Top2Vec learns jointly embedded topic, document and word vectors.β3,106Updated last year
- Fuzzy string matching, grouping, and evaluation.β786Updated 5 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherβ¦β1,253Updated 4 months ago
- LexNLP by LexPredictβ757Updated last year
- Single-document unsupervised keyword extractionβ1,801Updated last week
- Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsingβ561Updated last year
- Beautiful visualizations of how language differs among document types.β2,327Updated 7 months ago
- A Python library for calculating a large variety of metrics from textβ358Updated 11 months ago
- Fixes contractions such as `you're` to `you are`β318Updated 3 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!β477Updated 2 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)β793Updated 2 weeks ago