KeremZaman / semantic-shLinks
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆28Updated 11 months ago
Alternatives and similar repositories for semantic-sh
Users that are interested in semantic-sh are comparing it to the libraries listed below
Sorting:
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Query and visualize knowledge graphs☆59Updated 3 months ago
- ☆16Updated 4 years ago
- Instance Neighbouring by using Knowledge☆16Updated 9 months ago
- Multiplex: visualizations that tell stories—A Python library to create and annotate beautiful network graph visualizations, text visualiz…☆112Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆15Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆49Updated 3 years ago
- MinHash implementation in Python☆11Updated 10 months ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆52Updated last year
- LEMON: Explainable Entity Matching☆18Updated 3 years ago
- An open source python library for automated prediction engineering☆45Updated 3 weeks ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- Word embeddings for job postings☆13Updated 2 years ago
- How to build a multi-label sentiment classifiers with Tez and PyTorch☆19Updated 4 years ago
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆45Updated 2 years ago
- Blazing fast fuzzy text search for Python.☆45Updated 2 months ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- Python package for utilizing TigerGraph Databases☆32Updated this week
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Deep Learning how-to's using Lance file format☆19Updated last month
- BERT, LDA, and TFIDF based keyword extraction in Python☆73Updated last year
- Train and use generative text models in a few lines of code.☆20Updated 3 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 5 months ago
- Knowledge Base Embedding By Cooperative Knowledge Distillation☆67Updated 2 years ago
- Knowledge pills on Neural Search☆26Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆51Updated 6 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year