chenkovsky / cyac
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementation!
☆94Updated 6 months ago
Alternatives and similar repositories for cyac:
Users that are interested in cyac are comparing it to the libraries listed below
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- ☆93Updated 5 months ago
- A Fast ELMo Implementation. (NOT MAINTAIN ANYMORE)☆38Updated 2 years ago
- Pure python Aho-Corasick library.☆215Updated 2 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆77Updated 2 years ago
- The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"☆121Updated 4 years ago
- modification of official bert for downstream task☆31Updated 2 years ago
- SegPhrase working on Chinese and Arabic☆35Updated 8 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 6 years ago
- Python version of the evaluation script from CoNLL'00-☆94Updated 4 years ago
- Task-oriented dialog system toolkits☆85Updated 2 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆32Updated 4 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆77Updated 6 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆320Updated 4 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning☆52Updated 6 years ago
- Evaluation script for named entity recognition (NER) systems based on entity-level F1 score.☆70Updated 4 years ago
- ICU based universal language tokenizer☆31Updated 3 years ago
- Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling☆66Updated 5 years ago
- Automated Phrase Mining from Massive Text Corpora in Python.☆171Updated 3 years ago
- Labeled Span Graph Networks☆118Updated 6 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆44Updated 4 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 11 months ago
- SyntaxSQLNet: Syntax Tree Networks for Complex and Cross Domain Text-to-SQL Task☆133Updated 3 years ago
- datasets of natural language understanding and dialogue state tracking☆145Updated 4 years ago
- Python extension module for accelerating regular expressions using libesm☆131Updated last year
- A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Group☆24Updated 4 years ago
- Latest research advances on semantic slot filling.☆25Updated 2 years ago
- This is a CoNLL formatted version of the OntoNotes 5.0 release.☆189Updated 10 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆177Updated 9 months ago