Meteorix / pylcsLinks
super fast cpp implementation of longest common subsequence/substring
☆71Updated last year
Alternatives and similar repositories for pylcs
Users that are interested in pylcs are comparing it to the libraries listed below
Sorting:
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆95Updated 11 months ago
- ☆52Updated 4 years ago
- ICU based universal language tokenizer☆33Updated 3 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆137Updated 5 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆78Updated 2 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- terashuf shuffles multi-terabyte text files using limited memory☆226Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- ☆46Updated 3 years ago
- Finetune CPM-1☆24Updated 4 years ago
- A pre-trained model with multi-exit transformer architecture.☆55Updated 2 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆115Updated 4 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Updated 5 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- 非常好用的工具包,可以直接安装并使用☆21Updated 3 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Updated 5 years ago
- A text augmentation tool for named entity recognition.☆54Updated 4 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆292Updated 4 years ago
- The fast python bm25 algorithm implemented with reverted index☆48Updated 3 years ago
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆60Updated 5 years ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆96Updated 4 years ago
- Task-oriented dialog system toolkits☆86Updated 2 years ago
- 无监督文本生成的一些方法☆49Updated 4 years ago