rec / myersLinks
A tiny, generic implementation of the Myers diff algorithm
☆22Updated 5 years ago
Alternatives and similar repositories for myers
Users that are interested in myers are comparing it to the libraries listed below
Sorting:
- Pure python Aho-Corasick library.☆220Updated 3 weeks ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last year
- python bindings of cppjieba ,recommand jieba_fast for results consistency and speed balance☆22Updated 6 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆329Updated last month
- A clone of Darts (Double-ARray Trie System)☆159Updated 8 months ago
- 80x faster and 95% accurate language identification with Fasttext☆164Updated 2 years ago
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆547Updated last month
- super fast cpp implementation of longest common subsequence/substring☆72Updated 2 years ago
- Simhash and near-duplicate detection☆423Updated 2 years ago
- wrap cppjieba by swig.☆20Updated 7 years ago
- Python package providing an Inverted Index implementation using dictionaries☆36Updated 4 years ago
- Python package for lexicon; Trie and DAWG implementation.☆56Updated last year
- A small seq2seq punctuator tool based on DistilBERT☆53Updated last year
- 大规模中文语料☆44Updated 6 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- Python cffi binding to CppJieba☆15Updated 5 years ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆218Updated last week
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,081Updated last month
- Python bindings for RocksDB☆35Updated 3 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆381Updated 2 weeks ago
- Pure-Python full-text search library☆653Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 3 months ago
- BERT Tokenizer in C++☆79Updated 5 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆68Updated 4 months ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆69Updated 6 months ago
- Constants used in Chinese text processing☆386Updated last year
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Updated 7 years ago
- 双端trie树的python实现☆11Updated 7 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago