matwerner / fast-wmd
☆16Updated last year
Related projects: ⓘ
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 4 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆44Updated 3 years ago
- ☆23Updated 4 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 4 months ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆62Updated 4 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆35Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- ☆36Updated 2 years ago
- Accompanying repository of our AAAI-20 paper "Fine-Grained Argument Unit Recognition and Classification."☆20Updated 4 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆51Updated 2 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆32Updated last year
- ☆55Updated 3 weeks ago
- ☆29Updated 2 years ago
- ☆73Updated 3 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 5 years ago
- ☆41Updated 4 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆91Updated last year
- StAtutory Reasoning Assessment☆11Updated last year
- ☆34Updated 3 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆80Updated 3 weeks ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆63Updated last year
- ☆17Updated last year
- Automatically detect errors in annotated corpora.☆45Updated last year
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated last year