chenkovsky / cyac
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementation!
☆94Updated last month
Related projects ⓘ
Alternatives and complementary repositories for cyac
- ☆91Updated last week
- modification of official bert for downstream task☆31Updated last year
- Automatically extracting keyphrases that are salient to the document meanings is an essential step to semantic document understanding. An…☆155Updated last year
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆149Updated 2 years ago
- SegPhrase working on Chinese and Arabic☆33Updated 7 years ago
- A Fast ELMo Implementation. (NOT MAINTAIN ANYMORE)☆38Updated last year
- Pure python Aho-Corasick library.☆212Updated last year
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Updated 4 years ago
- Task-oriented dialog system toolkits☆85Updated last year
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆67Updated 4 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆76Updated last year
- This is a CoNLL formatted version of the OntoNotes 5.0 release.☆190Updated 9 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning☆51Updated 5 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆54Updated 5 years ago
- A HMM-like linear-chain CRF, used Tensorflow API.☆37Updated 6 years ago
- ☆40Updated 2 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆313Updated 3 years ago
- Baseline for the CNLI corpus☆57Updated 5 years ago
- ICU based universal language tokenizer☆30Updated 2 years ago
- Automated Phrase Mining from Massive Text Corpora in Python.☆168Updated 3 years ago
- A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Group☆25Updated 3 years ago
- Python version of the evaluation script from CoNLL'00-☆91Updated 3 years ago
- A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries☆12Updated 4 years ago
- This is the repository for NLPCC2020 task AutoIE☆52Updated 4 years ago
- seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomer☆51Updated 2 years ago
- ☆48Updated 3 years ago
- Python scripts preprocessing Penn Treebank and Chinese Treebank☆162Updated 4 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆112Updated 3 years ago