semanticize / levenLinks
☆21Updated 9 years ago
Alternatives and similar repositories for leven
Users that are interested in leven are comparing it to the libraries listed below
Sorting:
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆249Updated last week
- Python McParseface: A way to call Parsey McParseface programmatically in Python☆33Updated 2 years ago
- Memory-based shallow parser for Python☆74Updated 6 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 9 months ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 12 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 2 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago
- Goal: make Pattern compatible with Python 3.☆59Updated 5 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆166Updated 3 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Fast multi-keyword search engine for text strings☆257Updated last year
- Experimental parallel data analysis toolkit.☆122Updated 3 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- ``pynlg`` is a pure python re-implementation of [SimpleNLG-EnFr](https://github.com/rali-udem/SimpleNLG-EnFr), a java library enabling bi…☆29Updated 2 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- Import tables from any Wikipedia article as a dataset in Python☆292Updated 3 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- Python stemming library using snowball stemmers☆264Updated last month
- Entity Linking for the masses☆56Updated 9 years ago
- Data analysis tool.☆85Updated 2 years ago
- Python to Gremlin Graph Abstraction Layer☆55Updated 8 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Web page segmentation and noise removal☆55Updated last year