microsoft / ARXGEN
Scripts to parse arxiv documents for NLP tasks
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ARXGEN
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆17Updated last year
- Generative Retrieval Transformer☆29Updated last year
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 3 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Bi-Directional Attention Flow for Machine Comprehensions☆10Updated 6 years ago
- A simple semantic search engine for scientific papers.☆27Updated last year
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆13Updated last year
- PyTorch library for synthesizing programs from natural language☆18Updated 3 months ago
- CyBERTron-LM is a project which collects some pre-trained Transformer-based models.☆12Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆32Updated last year
- ☆14Updated 3 years ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11Updated 3 years ago
- ☆17Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- Repository for Skill Set Optimization☆12Updated 3 months ago
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Updated 3 years ago
- A curated list of papers exploring the limits of deep learning for NLP☆23Updated 6 years ago
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Updated 2 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated last year
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- ☆12Updated 3 years ago
- Resources for "Conversational Entity Linking: Problem Definition and Datasets"☆18Updated last year
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Updated 3 years ago