This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".
☆64Aug 13, 2020Updated 5 years ago
Alternatives and similar repositories for bertram
Users that are interested in bertram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago
- ☆12Mar 24, 2021Updated 5 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- ☆19Apr 7, 2020Updated 6 years ago
- ☆14Nov 22, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Mar 27, 2021Updated 5 years ago
- Ranking of Top Institutes for Natural Language Processing (NLP)☆23Apr 8, 2020Updated 6 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Sep 17, 2022Updated 3 years ago
- Data and preprocessing scripts for SemEval 2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding☆15Feb 3, 2022Updated 4 years ago
- ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation☆98Jul 6, 2023Updated 2 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)☆171Jun 15, 2022Updated 3 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- Massively Multilingual Transfer for NER☆86Oct 7, 2021Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆194Sep 22, 2025Updated 6 months ago
- ☆12Mar 31, 2026Updated last week
- EMNLP'2020: Look at the First Sentence: Position Bias in Question Answering☆29Nov 4, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated last year
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- SDK for TEASPN, a framework and a protocol for integrated writing assistance environments☆60Dec 9, 2022Updated 3 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Jun 15, 2020Updated 5 years ago
- A phenomenon-wise evaluation dataset for Japanese-English machine translation robustness. The dataset is based on the MTNT dataset, with …☆19Feb 18, 2021Updated 5 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Question generation from text☆15Sep 19, 2012Updated 13 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Temporal Word Analogies in Python☆18Aug 2, 2017Updated 8 years ago
- ☆21Oct 13, 2021Updated 4 years ago
- ☆93Feb 13, 2024Updated 2 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- [NeurIPS 2021] Open Rule Induction☆20May 22, 2022Updated 3 years ago