Meteorix / pylcs
super fast cpp implementation of longest common subsequence/substring
☆67Updated last year
Alternatives and similar repositories for pylcs:
Users that are interested in pylcs are comparing it to the libraries listed below
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 5 months ago
- ☆52Updated 3 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- 无监督文本生成的一些方法☆48Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆75Updated 2 years ago
- EasyTransfer is designed to make the development of transfer learning in NLP applications easier.☆8Updated 4 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆77Updated 2 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆27Updated 4 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- List some datasets in NLP field.☆29Updated 3 years ago
- code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"☆24Updated 3 years ago
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 3 years ago
- modification of official bert for downstream task☆31Updated 2 years ago
- 大规模中文语料☆40Updated 5 years ago
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆114Updated 3 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- ☆47Updated 4 years ago
- ☆17Updated 4 years ago
- ☆23Updated 4 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆52Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated last year
- A pre-trained model with multi-exit transformer architecture.☆55Updated 2 years ago
- Task-oriented dialog system toolkits☆85Updated 2 years ago
- Code and Data for SIGIR 2020 Paper "Few-Shot Generative Conversational Query Rewriting"☆65Updated last year