maobedkova / TopicModelling_PySpark_SparkNLP
Tutorial for Topic Modelling using PySpark and Spark NLP
☆16Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for TopicModelling_PySpark_SparkNLP
- ☆16Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆51Updated 10 months ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆31Updated 3 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆61Updated 3 months ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- Template for AC297r projects☆33Updated 4 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆60Updated last year
- ☆13Updated 3 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆32Updated 3 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 6 months ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- An evaluation of word-embeddings for classification☆33Updated 5 years ago
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Updated 4 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆37Updated last year
- sequence tagging with spaCy and crfsuite☆18Updated last year
- ☆15Updated 3 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearch☆30Updated 4 years ago
- Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019☆29Updated 5 years ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 2 years ago
- ☆29Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- ☆23Updated 4 years ago