chuanconggao / TopSimLinks
Efficiently search the most similar strings against the query in Python.
☆18Updated 4 months ago
Alternatives and similar repositories for TopSim
Users that are interested in TopSim are comparing it to the libraries listed below
Sorting:
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 8 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Python package providing an Inverted Index implementation using dictionaries☆35Updated 4 years ago
- Neutral Network based Chinese Segment System☆18Updated 8 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 4 years ago
- Deep neural parser for database query☆18Updated 2 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- A better working example of SIFRank and SIFRank+ models for keyword extraction. Easy to setup using docker-compose.☆11Updated 11 months ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 8 years ago
- Similarity search engine built around Faiss library☆78Updated 2 years ago
- Python Data Processing library☆102Updated last year
- Natural language generation language☆55Updated 6 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- A library & tools to evaluate predictive language models.☆63Updated 2 years ago
- An efficient simhash implementation for python☆126Updated 5 years ago
- Aho-Corasick string replacement utility☆25Updated 5 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmi…☆24Updated 8 months ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- Experiment with document similarity via Matt Kusner's MWD paper☆24Updated 9 years ago
- Code for "All-In-1: Short Text Classification with One Model for All Languages" - Plank (2017), IJCNLP 2017 shared task 4☆16Updated 7 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 8 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Updated 6 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago