krzysztoffiok / TextGuide
A text truncation method, useful for instance in long text classification
☆23Updated 2 years ago
Alternatives and similar repositories for TextGuide:
Users that are interested in TextGuide are comparing it to the libraries listed below
- ☆58Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- ☆41Updated 3 years ago
- ☆86Updated 3 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆26Updated 2 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- A collection of topic diversity measures for topic modeling☆45Updated 3 years ago
- ☆59Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 11 months ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆50Updated 4 years ago
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Updated 5 months ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- AttentionRank: Unsupervised keyphrase Extraction using Self and Cross Attentions☆26Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆80Updated 2 years ago
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆66Updated last year
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 4 years ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆33Updated 3 years ago
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 4 years ago
- A python script to break a sentence into clauses.☆34Updated 3 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 2 years ago
- Graph parsing approach to structured sentiment analysis.☆41Updated 2 years ago