jinhangjiang / morethansentiments
Python package to calculate Boilerplate and many other text quantified features
☆21Updated last year
Alternatives and similar repositories for morethansentiments:
Users that are interested in morethansentiments are comparing it to the libraries listed below
- ☆54Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- Python package for text mining of time-series data☆68Updated last month
- Fuzzy Topic Models☆26Updated 9 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 9 months ago
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆64Updated 2 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- Fast, flexible name matching for large datasets☆70Updated last year
- A Python client for the GDELT 2.0 Doc API☆112Updated 10 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆21Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆78Updated 5 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Blazing fast fuzzy text search for Python.☆42Updated last week
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆110Updated last week
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- A demo of a data science project using Kedro☆16Updated 3 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- Powerful topic model visualization in Python☆108Updated last week
- Use sync mode Playwright interactively, inside a Jupyter notebook☆14Updated last month
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 7 months ago
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆141Updated this week
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆42Updated 5 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago