4OH4 / doc-similarity
Ranking documents using semantic similarity in Python
☆35Updated 4 years ago
Alternatives and similar repositories for doc-similarity:
Users that are interested in doc-similarity are comparing it to the libraries listed below
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆53Updated last year
- Package that returns a company embedding given a company name☆44Updated 4 years ago
- ☆35Updated 3 years ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Named entity relevant project☆30Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Train a model to find the names of products in text☆35Updated 4 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- A Python implementation of a basic Knowledge Graph☆104Updated 3 years ago
- Perform Latent Dirichlet Allocation on scientific articles with Gensim☆15Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆159Updated 4 years ago
- Model training tutorials for the Stanza Python NLP Library☆37Updated 2 years ago
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆80Updated last year
- A simple Flask API for named entity extraction using spaCy Model☆48Updated 5 years ago
- Notebooks for fine-tuning a BERT model and training a LSTM model for financial QA☆30Updated 4 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆62Updated last year
- Applying BERT to named entity recognition in English and Russian.☆162Updated 2 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- store my personal project☆22Updated 4 years ago
- The official tool for transforming doccano format into common dataset formats.☆106Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- ☆16Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 2 years ago
- Multi Text Classificaiton☆92Updated 5 years ago
- sequence tagging with spaCy and crfsuite☆19Updated last year
- Named-entity recognition (NER) (also known as entity identification, entity chunking and entity extraction) is a subtask of information e…☆29Updated 4 years ago
- ☆22Updated 3 years ago
- Code related to experimentation of different Text Data Augmentation Techniques☆14Updated 5 years ago
- N-gram Extraction Approaches (bigrams, trigrams)☆43Updated 6 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆97Updated last year