Meteorix / pylcs
super fast cpp implementation of longest common subsequence/substring
☆66Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pylcs
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last month
- ICU based universal language tokenizer☆30Updated 2 years ago
- 大规模中文语料☆38Updated 5 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆74Updated 2 years ago
- 非常好用的工具包,可以直接安装并使用☆20Updated 2 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆27Updated 4 years ago
- modification of official bert for downstream task☆31Updated last year
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆112Updated 3 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆117Updated 3 years ago
- ☆52Updated 3 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆13Updated 4 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to train…☆24Updated 5 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆136Updated 4 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated last year
- Python下shuffle几百G文件☆33Updated 3 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆76Updated last year
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated last year
- Semi-supervised Learning for Sentiment Analysis☆53Updated 4 years ago
- ☆46Updated 3 years ago
- ☆11Updated 4 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆69Updated 3 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆106Updated 3 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 3 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆133Updated 5 months ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆124Updated 2 years ago
- ☆11Updated 2 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago