KeremZaman / semantic-sh
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆26Updated 5 months ago
Alternatives and similar repositories for semantic-sh:
Users that are interested in semantic-sh are comparing it to the libraries listed below
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆45Updated last year
- Just another sentiment wrapper.☆17Updated 3 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Metadata store for Production ML☆89Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 8 months ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- ☆69Updated 3 years ago
- ☆17Updated 3 years ago
- MinHash implementation in Python☆11Updated 4 months ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago
- Word2Vec encodings based search engine for Stackoverflow questions☆26Updated last year
- A few-shot learning method based on siamese networks.☆28Updated last year
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 5 months ago
- 🎛 Distributed machine learning made simple.☆49Updated last year
- A curated list of ML awesome frameworks & libraries for text data☆16Updated last year
- ☆42Updated last year
- Generating Training Data Made Easy☆43Updated 4 years ago
- Automatic Text Summarization and Title Generation.☆25Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆52Updated last year
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- ☆20Updated 2 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- Ngrams with Basic Smoothings☆19Updated 8 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆47Updated 6 months ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago