chuanconggao / TopSim
Efficiently search the most similar strings against the query in Python.
☆18Updated 6 years ago
Alternatives and similar repositories for TopSim:
Users that are interested in TopSim are comparing it to the libraries listed below
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 7 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Neutral Network based Chinese Segment System☆18Updated 8 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- ☆15Updated 3 years ago
- Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.☆28Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- A simple model for classifying papers by academic venue (AI/ML/ACL), given a title and abstract. Bare-metal PyTorch port of https://gith…☆12Updated 6 years ago
- Chinese word segmentation algorithm based on entropy(基于熵,无需语料库的中文分词)☆11Updated 7 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 6 years ago
- Tools and services for evaluating topic models☆15Updated 8 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- Easy to follow text classifying implementation using a Conv. Neural Network (Tensorflow)☆14Updated 7 years ago
- Facilitate the learning, practicing, and designing of neural text matching models with a user-friendly and interactive interface.☆38Updated 2 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- 为给定的一段文本抽取一个或多个基于知识树的标签。☆8Updated 9 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Updated 6 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- SQLFlow client library for Python☆29Updated 2 years ago