Meteorix / pylcsLinks
super fast cpp implementation of longest common subsequence/substring
☆72Updated last year
Alternatives and similar repositories for pylcs
Users that are interested in pylcs are comparing it to the libraries listed below
Sorting:
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆95Updated 11 months ago
- ☆52Updated 4 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆137Updated 5 years ago
- terashuf shuffles multi-terabyte text files using limited memory☆226Updated 2 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Updated 5 years ago
- The fast python bm25 algorithm implemented with reverted index☆48Updated 3 years ago
- Pure python Aho-Corasick library.☆219Updated 2 years ago
- Implementation of pQRNN in PyTorch☆46Updated 4 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- modification of official bert for downstream task☆32Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- a simple yet complete implementation of the popular BERT model☆128Updated 5 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆78Updated 2 years ago
- JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.☆53Updated last year
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- List some datasets in NLP field.☆29Updated 4 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆24Updated 4 months ago
- Finetune CPM-1☆24Updated 4 years ago
- ☆46Updated 4 years ago
- ICU based universal language tokenizer☆33Updated 3 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Source code to reproduce results of our paper "DIET: Lightweight Language Understanding for Dialogue Systems"☆63Updated 5 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- 无监督文本生成的一些方法☆49Updated 4 years ago
- 非常好用的工具包,可以直接安装并使用☆21Updated 3 years ago
- BERT for joint intent classification and slot filling☆39Updated 6 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆117Updated 4 months ago