rec / myers

A tiny, generic implementation of the Myers diff algorithm

☆20

Alternatives and similar repositories for myers:

Users that are interested in myers are comparing it to the libraries listed below

chenkovsky / cyac
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…
☆94Updated 6 months ago
Meteorix / pylcs
super fast cpp implementation of longest common subsequence/substring
☆67Updated last year
doccano / doccano-client
A simple client for doccano API.
☆85Updated 11 months ago
abusix / ahocorapy
Pure python Aho-Corasick library.
☆214Updated 2 years ago
messense / fasttext-serving
fastText model serving service
☆59Updated 5 months ago
akivajp / pycedar
Python binding of cedar (implementation of efficiently-updatable double-array trie) using Cython
☆17Updated 5 years ago
zejunwang1 / fastMatch
Large-scale exact string matching tool
☆17Updated last month
NightTsarina / python-rocksdb
Python bindings for RocksDB
☆34Updated 2 years ago
aosingh / lexpy
Python package for lexicon; Trie and DAWG implementation.
☆55Updated 4 months ago
steven-s / text-shingles
k-shingling for text to help compare similarity
☆19Updated 5 years ago
bung87 / cppjieba-py
python bindings of cppjieba ,recommand jieba_fast for results consistency and speed balance
☆21Updated 5 years ago
nusnlp / m2scorer
MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.
☆151Updated 2 years ago
sheng-kai-wang / DST4LLM
DST(Dialogue State Tracker) for LLM(Large Language Model)
☆23Updated last year
G-Research / ahocorasick_rs
Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python
☆181Updated this week
OpenNMT / Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
☆302Updated last week
aboSamoor / pycld2
☆169Updated 3 weeks ago
MichaelAquilina / hashedindex
Python package providing an Inverted Index implementation using dictionaries
☆35Updated 3 years ago
touhi99 / N-gram-Language-model
Programming for NLP Project - Implement a basic n-gram language model and generate sentence using beam search
☆12Updated 5 years ago
huridocs / pdf-reading-order
☆13Updated last year
virtualsociety / ai-table-recognition
☆38Updated 4 years ago
davidberenstein1957 / fast-sentence-transformers
Simply, faster, sentence-transformers
☆141Updated 7 months ago
DinLei / DoubleArrayTrie
双端trie树的python实现
☆11Updated 6 years ago
4AI / langml
A Keras-based and TensorFlow-backend NLP Models Toolkit.
☆11Updated 2 years ago
nnnet / superminhash
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
☆19Updated 2 years ago
Jason3900 / gector-fast
A faster, simpler and distributed implementation of GECToR, a seq2edit GEC model
☆15Updated 2 years ago
mammothb / editdistpy
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…
☆23Updated 7 months ago
FerdinandZhong / punctuator
A small seq2seq punctuator tool based on DistilBERT
☆51Updated 4 months ago
messense / fasttext-wheel
Build and upload fastText Python wheels to PyPI
☆23Updated last year
grantjenks / python-wordsegment
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
☆374Updated 2 years ago
google-research-datasets / clang8
cLang-8 is a dataset for grammatical error correction.
☆104Updated 2 years ago