Meteorix / pylcs
super fast cpp implementation of longest common subsequence/substring
☆66Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pylcs
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 3 weeks ago
- ☆52Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆73Updated 2 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆27Updated 4 years ago
- ☆22Updated 3 years ago
- 非常好用的工具包,可以直接安装并使用☆20Updated 2 years ago
- modification of official bert for downstream task☆31Updated last year
- Python下shuffle几百G文件☆33Updated 3 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆21Updated 2 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆76Updated last year
- 大规模中文语料☆38Updated 5 years ago
- 无监督文本生成的一些方法☆49Updated 3 years ago
- 香侬科技(北京香侬慧语科技有限责任公司)知乎爆料备份☆41Updated 4 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- A Translation Task using TurboTransformers☆11Updated 3 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- EasyTransfer is designed to make the development of transfer learning in NLP applications easier.☆8Updated 4 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆51Updated 2 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆36Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆58Updated 2 years ago
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆112Updated 3 years ago
- ☆15Updated 4 years ago
- ICU based universal language tokenizer☆29Updated 2 years ago
- Finetune CPM-1☆24Updated 3 years ago
- List some datasets in NLP field.☆29Updated 3 years ago