Meteorix / pylcsLinks
super fast cpp implementation of longest common subsequence/substring
☆72Updated 2 years ago
Alternatives and similar repositories for pylcs
Users that are interested in pylcs are comparing it to the libraries listed below
Sorting:
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆95Updated last year
- ICU based universal language tokenizer☆33Updated 3 years ago
- ☆52Updated 4 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆137Updated 5 years ago
- The fast python bm25 algorithm implemented with reverted index☆48Updated 3 years ago
- terashuf shuffles multi-terabyte text files using limited memory☆226Updated 2 years ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Updated last month
- Implementation of pQRNN in PyTorch☆46Updated 4 years ago
- Pure python Aho-Corasick library.☆220Updated 2 years ago
- JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.☆53Updated last year
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Updated 5 years ago
- ☆45Updated 4 years ago
- Python APTED algorithm for the Tree Edit Distance☆98Updated 7 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Updated 5 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Updated 2 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆78Updated 2 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆163Updated 2 years ago
- ☆13Updated 5 years ago
- ☆16Updated 5 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆35Updated 5 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- A Translation Task using TurboTransformers☆11Updated 4 years ago
- The unified platform for data-related resources.☆134Updated 2 years ago
- A pre-trained model with multi-exit transformer architecture.☆56Updated 2 years ago
- List some datasets in NLP field.☆29Updated 4 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Updated 4 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago