Meteorix / pylcsLinks
super fast cpp implementation of longest common subsequence/substring
☆72Updated 2 years ago
Alternatives and similar repositories for pylcs
Users that are interested in pylcs are comparing it to the libraries listed below
Sorting:
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last year
- ☆52Updated 4 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆137Updated 5 years ago
- Pure python Aho-Corasick library.☆220Updated last week
- ICU based universal language tokenizer☆33Updated 4 years ago
- The fast python bm25 algorithm implemented with reverted index☆49Updated 3 years ago
- modification of official bert for downstream task☆32Updated 2 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆78Updated 2 years ago
- terashuf shuffles multi-terabyte text files using limited memory☆228Updated 2 years ago
- Implementation of pQRNN in PyTorch☆46Updated 4 years ago
- 无监督文本生成的一些方法☆49Updated 4 years ago
- ☆45Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Updated 5 years ago
- Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"☆110Updated 5 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- a simple yet complete implementation of the popular BERT model☆128Updated 5 years ago
- JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.☆54Updated last year
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- Position embedding layers in Keras☆58Updated 4 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 5 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 3 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆54Updated 6 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆60Updated 5 years ago
- Finetune CPM-1☆24Updated 4 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆60Updated 5 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Network…☆115Updated 4 years ago