microsoft / CyBERTron-LM
CyBERTron-LM is a project which collects some pre-trained Transformer-based models.
☆12Updated last year
Alternatives and similar repositories for CyBERTron-LM:
Users that are interested in CyBERTron-LM are comparing it to the libraries listed below
- ☆14Updated last year
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆29Updated 2 years ago
- ☆22Updated last year
- We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…☆38Updated 2 years ago
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 4 years ago
- Generative Retrieval Transformer☆28Updated last year
- Fault-aware neural code rankers☆28Updated 2 years ago
- ☆33Updated 2 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆60Updated last year
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆17Updated last year
- Boosting Natural Language Generation from Instructions with Meta-Learning☆10Updated 2 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- ☆19Updated 5 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated last year
- Renee: End-to-end training of extreme classification models☆21Updated last year
- Code for paper "Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling"☆7Updated 5 years ago
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆32Updated last year
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- This repository contains the dataset and the pytorch implementations of the models from the paper CIDER: Commonsense Inference for Dialog…☆27Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- ☆27Updated last year
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge☆15Updated 3 years ago
- Knowledge Computing group - MSRA☆86Updated last year
- ☆42Updated 5 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 8 months ago
- Website for TextVQA dataset.☆28Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 10 months ago
- ☆45Updated 2 years ago