dongfang91 / text_similarity
Text similarity using BERT sentence embeddings
☆20Updated 4 years ago
Alternatives and similar repositories for text_similarity:
Users that are interested in text_similarity are comparing it to the libraries listed below
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 3 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆34Updated 4 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Multitask Learning with Pretrained Transformers☆39Updated 3 years ago
- Schema2QA Question Answering Dataset☆18Updated 2 years ago
- BERT, LDA, and TFIDF based keyword extraction in Python☆71Updated 11 months ago
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated last month
- ☆12Updated last year
- ☆19Updated 2 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆79Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago
- ☆65Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 4 years ago
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 4 years ago
- ☆69Updated 4 years ago
- A text augmentation tool for named entity recognition.☆52Updated 3 years ago
- Using Huggingface to generate relation expressions☆15Updated 4 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆26Updated 2 years ago
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Updated 3 months ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Updated 6 months ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆54Updated 3 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago