chenkovsky / cyacLinks
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementation!
☆95Updated 9 months ago
Alternatives and similar repositories for cyac
Users that are interested in cyac are comparing it to the libraries listed below
Sorting:
- ☆93Updated this week
- Pure python Aho-Corasick library.☆216Updated 2 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 5 years ago
- The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"☆121Updated 4 years ago
- A Fast ELMo Implementation. (NOT MAINTAIN ANYMORE)☆38Updated 2 years ago
- modification of official bert for downstream task☆31Updated 2 years ago
- Inference with state-of-the-art models (pre-trained by LD-Net / AutoNER / VanillaNER / ...)☆115Updated 6 years ago
- Task-oriented dialog system toolkits☆85Updated 2 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆77Updated 2 years ago
- seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomer☆50Updated 3 years ago
- SegPhrase working on Chinese and Arabic☆35Updated 8 years ago
- A Python implementation of the BM25 ranking function.☆236Updated 5 years ago
- this project is for Semantic role labeling using bert☆36Updated 6 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 6 years ago
- Latest research advances on semantic slot filling.☆25Updated 2 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 6 years ago
- A HMM-like linear-chain CRF, used Tensorflow API.☆36Updated 7 years ago
- ICU based universal language tokenizer☆32Updated 3 years ago
- Exploiting entity linking in queries for entity retrieval☆81Updated 6 years ago
- reference tensorflow code for named entity tagging☆104Updated 3 years ago
- ☆38Updated 3 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- super fast cpp implementation of longest common subsequence/substring☆69Updated last year
- ☆47Updated 4 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- An extension of word2vec to learn phrase embeddings☆75Updated 6 years ago
- ☆41Updated 7 years ago
- ☆57Updated 7 years ago
- NoiseMix - data generation for natural language☆40Updated 7 years ago
- ☆31Updated 8 years ago