chuanconggao / TopSim
Efficiently search the most similar strings against the query in Python.
☆18Updated last month
Alternatives and similar repositories for TopSim
Users that are interested in TopSim are comparing it to the libraries listed below
Sorting:
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 3 years ago
- Neutral Network based Chinese Segment System☆18Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Tools and services for evaluating topic models☆15Updated 9 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- Python search module for fast approximate string matching☆54Updated 2 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 7 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last year
- allennlp + streamlit demo☆22Updated 5 years ago
- CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages☆20Updated 7 years ago
- A simple model for classifying papers by academic venue (AI/ML/ACL), given a title and abstract. Bare-metal PyTorch port of https://gith…☆12Updated 7 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Utility to help search within a set of jupyter notebooks☆16Updated 5 years ago
- 为给定的一段文本抽取一个或多个基于知识树的标签。☆8Updated 9 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Updated 5 years ago
- SQLFlow client library for Python☆29Updated 2 years ago
- A memory-based, optional-persistence naïve bayesian text classifier.☆36Updated 10 years ago
- Python Data Processing library☆102Updated last year
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- ☆13Updated 5 years ago
- implement some outlier detection algorithms☆11Updated 9 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 2 years ago