MikeWangWZHL / Zemi
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆16Updated last year
Related projects: ⓘ
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆21Updated last year
- ☆13Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆22Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆26Updated 3 months ago
- Adding new tasks to T0 without catastrophic forgetting☆30Updated last year
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated last year
- Code for our BlackboxNLP'20 paper "BERTnesia: Investigating the capture and forgetting of knowledge in BERT"☆9Updated 3 years ago
- ☆11Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 3 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆16Updated last month
- ☆22Updated last year
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆17Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated last year
- ☆42Updated last year
- ☆24Updated 6 months ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆20Updated last year
- Code for Navigating Connected Memories with a Task-oriented Dialog System☆17Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆15Updated last year
- Repository for Skill Set Optimization☆12Updated last month
- ☆23Updated 2 weeks ago
- ☆14Updated 6 months ago
- Tasks for describing differences between text distributions.☆15Updated last month
- ☆14Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- ☆19Updated last year
- ☆28Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- ☆13Updated this week