linyuehzzz / semantic_address_matching
An implementation of our paper in IJGIS 'A deep learning architecture for semantic address matching'
☆37Updated 5 years ago
Alternatives and similar repositories for semantic_address_matching:
Users that are interested in semantic_address_matching are comparing it to the libraries listed below
- Deep Learning for Semantic Text Matching☆18Updated 4 years ago
- 📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.☆26Updated 6 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆31Updated 3 years ago
- Use ML-Annotate to label data for machine learning purposes☆107Updated 4 years ago
- Refer to paper "Embedding-based News Recommendation for Millions of Users" & "Article De-duplication Using Distributed Representations" p…☆31Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆32Updated 3 years ago
- Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.☆77Updated 6 years ago
- Fine-tune BERT to generate sentence embedding for cosine similarity☆69Updated 5 years ago
- Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation☆98Updated 3 months ago
- ☆17Updated 2 years ago
- A simple search engine to search medium stories built with streamlit and elasticsearch.☆40Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Fine-grained Sentiment Analysis of User Reviews from AI Challenger☆11Updated 5 years ago
- Keyphrase Extraction based on Scientific Text, Semeval 2017, Task 10☆108Updated 2 years ago
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 8 years ago
- Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu☆16Updated 10 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Experiments on how to use machine learning to rank a product catalog☆84Updated 7 years ago
- In this project, we need to find out commercial products listed on Google that refer to the same entity across Amazon by comparing the si…☆11Updated 8 years ago
- The official tool for transforming doccano format into common dataset formats.☆106Updated last year
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆159Updated 4 years ago
- AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)☆137Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆136Updated 7 months ago
- models and evaluation framework for trending topics detection☆34Updated 8 months ago
- Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019☆29Updated 6 years ago