chuanconggao / TopSimLinks
Efficiently search the most similar strings against the query in Python.
☆18Updated 7 months ago
Alternatives and similar repositories for TopSim
Users that are interested in TopSim are comparing it to the libraries listed below
Sorting:
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Python Data Processing library☆102Updated 2 years ago
- Neutral Network based Chinese Segment System☆19Updated 9 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 7 years ago
- Python package providing an Inverted Index implementation using dictionaries☆36Updated 4 years ago
- Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.☆27Updated 8 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 8 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 10 years ago
- The classic movies redux with machine learning using TensorFlow and Keras.☆11Updated 6 years ago
- Levenshtein and Hamming distance computation☆117Updated 6 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- Text pre-processing library for deep learning (Keras, tensorflow).☆117Updated 7 years ago
- A library & tools to evaluate predictive language models.☆64Updated 2 years ago
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Updated 7 years ago
- Paragraph Vector Implementation☆56Updated 8 years ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 12 years ago
- Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…☆24Updated 5 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 3 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 8 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- A better working example of SIFRank and SIFRank+ models for keyword extraction. Easy to setup using docker-compose.☆11Updated last year
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- Tensorflow implementation of Facebook TagSpace☆74Updated 6 years ago
- Tools to manipulate and extract data from wikipedia dumps☆46Updated 12 years ago
- content discovery... IN 3D☆49Updated 8 years ago
- Natural language processing using unsupervised vectors representation.☆105Updated 5 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆37Updated 3 years ago