KeremZaman / semantic-shLinks
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆28Updated last year
Alternatives and similar repositories for semantic-sh
Users that are interested in semantic-sh are comparing it to the libraries listed below
Sorting:
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆74Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆47Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated 11 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Query and visualize knowledge graphs☆62Updated 9 months ago
- Various Jupyter notebooks about Common Crawl data☆61Updated last month
- ☆16Updated 2 years ago
- Python text processing, pattern matching, and NLP framework☆66Updated 2 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆29Updated 4 years ago
- ☆13Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated 2 months ago
- Creating class-based TF-IDF matrices☆91Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation☆99Updated last year
- An open source python library for automated prediction engineering☆45Updated 6 months ago
- simple rule based named entity recognition☆42Updated 3 years ago
- BERT, LDA, and TFIDF based keyword extraction in Python☆76Updated last week
- Implementation of SiameseXML (ICML 2021)☆40Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- COVID-19 Open Research Dataset (CORD-19) Analysis☆57Updated 3 years ago
- Efficient BM25 with DuckDB 🦆☆59Updated last year
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆70Updated last month
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆53Updated last year
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆22Updated 5 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 4 years ago