Meteorix / pylcsLinks
super fast cpp implementation of longest common subsequence/substring
☆68Updated last year
Alternatives and similar repositories for pylcs
Users that are interested in pylcs are comparing it to the libraries listed below
Sorting:
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 7 months ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆76Updated 2 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆75Updated 2 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to train…☆23Updated 6 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 5 years ago
- 分享一些S2S在实际应 用中遇到的问题和解决方法。☆27Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆151Updated 4 years ago
- Knowledge Distillation from BERT☆52Updated 6 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆54Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆26Updated 5 years ago
- NoiseMix - data generation for natural language☆40Updated 7 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆136Updated 4 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Updated 4 years ago
- ☆46Updated 3 years ago
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆115Updated 3 years ago
- ☆52Updated 4 years ago
- 香侬科技(北京香侬慧语科技有限责任公司)知乎爆料备份☆41Updated 5 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated last year
- Pretrain CPM-1☆51Updated 4 years ago
- A text augmentation tool for named entity recognition.☆52Updated 3 years ago
- ROUGE for multilingual Summarization☆24Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- ☆33Updated 5 years ago
- modification of official bert for downstream task☆31Updated 2 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆13Updated 4 years ago
- ☆59Updated 5 years ago