ttavni / 2D_Text_ClusteringLinks
Using word embeddings, TFIDF and text-hashing to cluster and visualise text documents
☆15Updated 6 years ago
Alternatives and similar repositories for 2D_Text_Clustering
Users that are interested in 2D_Text_Clustering are comparing it to the libraries listed below
Sorting:
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Updated last year
- SpikeX - SpaCy Pipes for Knowledge Extraction☆402Updated 4 years ago
- Smarter Manual Annotation for Resource-constrained collection of Training data☆230Updated last year
- Train Spacy ner with custom dataset☆182Updated 3 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆318Updated last year
- 🏖 Easy training and deployment of seq2seq models.☆228Updated 4 years ago
- Deep learning with text doesn't have to be scary.☆274Updated 3 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Use ML-Annotate to label data for machine learning purposes☆110Updated 5 years ago
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆201Updated last year
- Build a deep learning model for predicting the named entities from text.☆55Updated 7 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- NLP French language model implementing ULMFiT☆87Updated 6 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆474Updated 3 years ago
- OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning ex…☆53Updated 2 years ago
- A clean and easy interface for performing nearest-neighbor lookups☆50Updated 6 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆234Updated 2 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 5 years ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆504Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 5 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆33Updated 4 years ago
- A Python library for Interpretable Machine Learning in Text Classification using the SS3 model, with easy-to-use visualization tools for …☆349Updated 3 months ago
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- Repository for Project Insight: NLP as a Service☆319Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆320Updated 3 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- Storage and retrieval of Word Embeddings in various databases☆51Updated 7 years ago