KeremZaman / semantic-shLinks
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆28Updated last year
Alternatives and similar repositories for semantic-sh
Users that are interested in semantic-sh are comparing it to the libraries listed below
Sorting:
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆46Updated 2 years ago
- A curated list of awesome data annotation tools☆214Updated 2 years ago
- Python text processing, pattern matching, and NLP framework☆66Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 7 months ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- Use ML-Annotate to label data for machine learning purposes☆111Updated 5 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- Query and visualize knowledge graphs☆59Updated 5 months ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Knowledge Base Embedding By Cooperative Knowledge Distillation☆67Updated 2 years ago
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆11Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆53Updated last year
- Graph databases, Knowledge Graphs, SPARQ☆80Updated 4 years ago
- simple rule based named entity recognition☆42Updated 3 years ago
- Social Media Mining Toolkit (SMMT) main repository☆137Updated 2 years ago
- Automatic machine learning for tabular data. ⚡🔥⚡☆70Updated 3 years ago
- 🖍️ Highlight text in documents☆109Updated 4 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 4 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Creating class-based TF-IDF matrices☆90Updated 2 years ago
- BERT, LDA, and TFIDF based keyword extraction in Python☆74Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- ☆13Updated 3 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Updated 4 years ago