ryszard / python-ngrams
N-grams approximate string matching implementation in pure Python
☆26Updated 14 years ago
Alternatives and similar repositories for python-ngrams:
Users that are interested in python-ngrams are comparing it to the libraries listed below
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Pure-python reader for DAWGs created by dawgdic C++ library or DAWG Python extension.☆48Updated last year
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 7 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 13 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 2 weeks ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 8 years ago
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- Entity Linking for the masses☆56Updated 9 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 8 years ago
- Contains the main implementation of programs for the paper: Reproducing and learning new algebraic operations on word embeddings using ge…☆12Updated 8 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from …☆26Updated 10 years ago
- A Python module for extracting relevant tags from text documents.☆16Updated 13 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 3 years ago