LowriWilliams / Topic_Modelling_Beyond_Tokens
Investigating into how to extract meaningful topic names from textual data
☆20Updated 4 years ago
Alternatives and similar repositories for Topic_Modelling_Beyond_Tokens:
Users that are interested in Topic_Modelling_Beyond_Tokens are comparing it to the libraries listed below
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated 11 months ago
- Template for AC297r projects☆33Updated 5 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 4 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 8 years ago
- ☆54Updated 3 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- store my personal project☆22Updated 4 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- ☆30Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- ☆54Updated last year
- ☆16Updated 4 years ago
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Automatic labeling for topic model☆57Updated 9 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆84Updated 7 months ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- ☆54Updated 9 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆58Updated 10 months ago