JohnSnowLabs / spark-nlp
State of the Art Natural Language Processing
☆3,965Updated this week
Alternatives and similar repositories for spark-nlp
Users that are interested in spark-nlp are comparing it to the libraries listed below
Sorting:
- Public runnable examples of using John Snow Labs' NLP for Apache Spark.☆1,056Updated this week
- A system for quickly generating training data with weak supervision☆5,856Updated last year
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 5 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,461Updated 2 weeks ago
- NLP, before and after spaCy☆2,225Updated last year
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,263Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 3 months ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,762Updated 10 months ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,877Updated 2 years ago
- 🦆 Contextually-keyed word vectors☆1,652Updated 3 weeks ago
- Open source platform for the machine learning lifecycle☆20,456Updated this week
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆912Updated 3 months ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,544Updated last week
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,750Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,161Updated last week
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,036Updated 6 months ago
- Open source annotation tool for machine learning practitioners.☆9,968Updated 5 months ago
- Natural Language Processing Best Practices & Examples☆6,410Updated 2 years ago
- 🪐 End-to-end NLP workflows from prototype to production☆1,378Updated 7 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,415Updated 3 weeks ago
- Language-Agnostic SEntence Representations☆3,638Updated last year
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,508Updated 5 months ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,848Updated last month
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,151Updated this week
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,670Updated 3 weeks ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆6,981Updated 3 weeks ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,050Updated this week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,770Updated 3 weeks ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,173Updated 9 months ago
- 👩🏫 Advanced NLP with spaCy: A free online course☆2,359Updated 3 months ago