The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper
☆70May 14, 2023Updated 3 years ago
Alternatives and similar repositories for SLED
Users that are interested in SLED are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Mar 26, 2024Updated 2 years ago
- ☆11Jul 15, 2020Updated 5 years ago
- 문서 요약 논문 정리☆15Oct 27, 2021Updated 4 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Jun 5, 2024Updated last year
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- ☆19Apr 10, 2024Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- Query-focused summarization data☆44Feb 17, 2023Updated 3 years ago
- ☆12Jun 23, 2023Updated 2 years ago
- Fusion-in-Decoder☆593Oct 4, 2023Updated 2 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆130Apr 23, 2022Updated 4 years ago
- ☆14Dec 9, 2021Updated 4 years ago
- Codes for coreference-aware machine reading comprehension☆13Mar 13, 2022Updated 4 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence Representation Learning. https://arxiv.org/abs/2210.16536☆40Nov 1, 2022Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- ☆57Mar 27, 2023Updated 3 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 10 months ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Jul 4, 2021Updated 4 years ago
- ☆15Jun 2, 2025Updated 11 months ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆559Apr 8, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- ☆26Aug 14, 2022Updated 3 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆57Aug 2, 2023Updated 2 years ago
- Code for ACL 2022 paper on the topic of long document summarization: MemSum: Extractive Summarization of Long Documents Using Multi-Step …☆49Jun 19, 2024Updated last year
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- Investigating Cultural Alignment of Large Language Models☆13Aug 14, 2024Updated last year