KeremZaman / semantic-sh
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆26Updated 9 months ago
Alternatives and similar repositories for semantic-sh
Users that are interested in semantic-sh are comparing it to the libraries listed below
Sorting:
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆45Updated 2 years ago
- Turkish-Sentence Encoder with Quick-Thought Vectors☆11Updated 5 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆11Updated 3 years ago
- Extract dates from text☆64Updated 4 years ago
- Visualizing ELMo Contextual Vectors for Word Sense Disambiguation☆15Updated 4 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated last month
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 4 months ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- ☆15Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- NLP tool to extract emotional phrase from tweets 🤩☆40Updated 3 years ago
- MinHash implementation in Python☆11Updated 8 months ago
- Topic Inference with Zeroshot models☆61Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through use…☆27Updated 2 years ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆42Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- ☆16Updated 4 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago