chuanconggao / TopSimLinks
Efficiently search the most similar strings against the query in Python.
☆18Updated 6 months ago
Alternatives and similar repositories for TopSim
Users that are interested in TopSim are comparing it to the libraries listed below
Sorting:
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Python Data Processing library☆102Updated last year
- Neutral Network based Chinese Segment System☆18Updated 9 years ago
- Python package providing an Inverted Index implementation using dictionaries☆35Updated 4 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 4 years ago
- Aho-Corasick string replacement utility☆25Updated 6 years ago
- Tools to manipulate and extract data from wikipedia dumps☆46Updated 12 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- An efficient simhash implementation for python☆126Updated 6 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 5 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 2 years ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 12 years ago
- Natural language generation language☆55Updated 6 years ago
- Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.☆28Updated 8 years ago
- Similarity search engine built around Faiss library☆78Updated 2 years ago
- Information Retrieval Library (in Python)☆83Updated 3 years ago
- Paragraph Vector Implementation☆56Updated 8 years ago
- Fast multi-keyword search engine for text strings☆257Updated last year
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 10 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- Find strings/words in text; convenience and C speed☆127Updated 3 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 7 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 10 years ago
- Handle many API calls from a single HTTP request☆55Updated 7 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 8 years ago
- ☆24Updated 7 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 4 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago