maobedkova / TopicModelling_PySpark_SparkNLP
Tutorial for Topic Modelling using PySpark and Spark NLP
☆17Updated 4 years ago
Alternatives and similar repositories for TopicModelling_PySpark_SparkNLP:
Users that are interested in TopicModelling_PySpark_SparkNLP are comparing it to the libraries listed below
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 8 months ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- ☆16Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- ☆35Updated 3 years ago
- Template for AC297r projects☆33Updated 5 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆34Updated 3 years ago
- ☆30Updated 2 years ago
- The official tool for transforming doccano format into common dataset formats.☆106Updated 2 years ago
- ☆16Updated 4 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆71Updated last year
- ☆43Updated 2 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- Generating Training Data Made Easy☆43Updated 4 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated last year
- ☆54Updated last year
- ☆13Updated 3 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year