chuanconggao / TopSimLinks
Efficiently search the most similar strings against the query in Python.
☆18Updated 8 months ago
Alternatives and similar repositories for TopSim
Users that are interested in TopSim are comparing it to the libraries listed below
Sorting:
- Python search module for fast approximate string matching☆54Updated 3 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 7 years ago
- Python package providing an Inverted Index implementation using dictionaries☆36Updated 4 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆64Updated 5 years ago
- Python Data Processing library☆102Updated 2 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 8 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmi…☆24Updated last year
- An easy-install script for LibShortText☆27Updated 11 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- Neutral Network based Chinese Segment System☆19Updated 9 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 10 years ago
- Similarity search engine built around Faiss library☆78Updated 3 years ago
- An efficient simhash implementation for python☆127Updated 6 years ago
- Chinese word segmentation algorithm based on entropy(基于熵,无需语料库的中文分词)☆11Updated 7 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 4 years ago
- Caver: a toolkit for multilabel text classification.☆39Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 9 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 3 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 8 years ago
- Paragraph Vector Implementation☆56Updated 8 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- Web Full Stack Practice for Beginners:Docker + uWSGI + Celery + Django + Supervisor + React + Nginx + HTTPS + Postgres + Redis☆38Updated 3 years ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 12 years ago
- Find strings/words in text; convenience and C speed☆126Updated 3 years ago
- Different approaches to computing document similarity☆28Updated 9 years ago
- ☆56Updated 10 years ago
- ☆15Updated 7 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 16 years ago
- Aho-Corasick string replacement utility☆26Updated 6 years ago