john-hewitt / embed-initLinks
Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs
☆18Updated 3 years ago
Alternatives and similar repositories for embed-init
Users that are interested in embed-init are comparing it to the libraries listed below
Sorting:
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆38Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated 10 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Updated 2 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- ☆14Updated 8 months ago
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated 2 years ago
- Learning to Model Editing Processes☆26Updated 3 years ago
- ☆29Updated 3 years ago
- Pile Deduplication Code☆19Updated 2 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Updated 2 years ago
- ☆25Updated 2 years ago
- Repository for Skill Set Optimization☆13Updated 11 months ago
- Staged Training for Transformer Language Models☆32Updated 3 years ago
- ☆11Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last month
- ☆20Updated last year