This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".
☆64Aug 13, 2020Updated 5 years ago
Alternatives and similar repositories for bertram
Users that are interested in bertram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago
- ☆12Mar 24, 2021Updated 4 years ago
- ☆19Apr 7, 2020Updated 5 years ago
- ☆14Nov 22, 2024Updated last year
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- AMR-to-text Generation with Graph Transformer☆18Nov 16, 2020Updated 5 years ago
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Mar 27, 2021Updated 4 years ago
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Oct 14, 2020Updated 5 years ago
- Ranking of Top Institutes for Natural Language Processing (NLP)☆23Apr 8, 2020Updated 5 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Sep 17, 2022Updated 3 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- ☆10May 5, 2017Updated 8 years ago
- User-friendly extensions to MeSH☆11Feb 4, 2016Updated 10 years ago
- Data and preprocessing scripts for SemEval 2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding☆14Feb 3, 2022Updated 4 years ago
- ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation☆98Jul 6, 2023Updated 2 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)☆171Jun 15, 2022Updated 3 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Massively Multilingual Transfer for NER☆86Oct 7, 2021Updated 4 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆194Sep 22, 2025Updated 6 months ago
- ☆12Feb 16, 2026Updated last month
- EMNLP'2020: Look at the First Sentence: Position Bias in Question Answering☆29Nov 4, 2020Updated 5 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated last year
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- ☆62Apr 19, 2022Updated 3 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Jun 15, 2020Updated 5 years ago
- A phenomenon-wise evaluation dataset for Japanese-English machine translation robustness. The dataset is based on the MTNT dataset, with …☆19Feb 18, 2021Updated 5 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Question generation from text☆15Sep 19, 2012Updated 13 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Apr 16, 2021Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Temporal Word Analogies in Python☆18Aug 2, 2017Updated 8 years ago