The unified platform for data-related resources.
☆135Mar 6, 2023Updated 2 years ago
Alternatives and similar repositories for DataLab
Users that are interested in DataLab are comparing it to the libraries listed below
Sorting:
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 3 years ago
- Knowledge Graph Simple Question Answering for Unseen Domains☆13Jul 2, 2025Updated 7 months ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆14Aug 23, 2022Updated 3 years ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 2 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆367Jun 27, 2022Updated 3 years ago
- GSum: A General Framework for Guided Neural Abstractive Summarization☆116Sep 22, 2025Updated 5 months ago
- ☆17May 19, 2023Updated 2 years ago
- ☆37Sep 22, 2021Updated 4 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- ☆39Jun 7, 2023Updated 2 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Nov 23, 2022Updated 3 years ago
- This is a repository for the Travatar forest-to-string translation decoder☆29Aug 7, 2021Updated 4 years ago
- This is the repo for the paper "Revealing the Importance of Semantic Retrieval for Machine Reading at Scale".☆60Nov 25, 2019Updated 6 years ago
- Mastering-NLP-from-Foundations-to-LLMs☆10Apr 11, 2025Updated 10 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- 한국어 소설 텍스트를 위한 자연어처리 라이브러리입니다. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)☆11Jan 16, 2024Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- The source code of paper "Semi-supervised Relation Extraction via Incremental Meta Self-Training"☆22Oct 7, 2020Updated 5 years ago
- Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…☆26Nov 10, 2021Updated 4 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45May 13, 2024Updated last year
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- Annotated Enron Subject Line Corpus (AESLC)☆25Feb 2, 2023Updated 3 years ago
- ROUGE for multilingual Summarization☆25Oct 11, 2021Updated 4 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- Models for explainable recommendation.☆12Jan 19, 2024Updated 2 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated 10 months ago
- ☆15Nov 22, 2023Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago