microsoft / ARXGEN
Scripts to parse arxiv documents for NLP tasks
☆17Updated last year
Alternatives and similar repositories for ARXGEN:
Users that are interested in ARXGEN are comparing it to the libraries listed below
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- Generative Retrieval Transformer☆28Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Hugging Face and Pyserini interoperability☆20Updated last year
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- ☆16Updated last year
- ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval☆15Updated 3 years ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆17Updated last year
- Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".☆20Updated 4 years ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- A simple semantic search engine for scientific papers.☆27Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11Updated 3 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 2 years ago
- ☆34Updated last year
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- A generic library for crafting adversarial NLP examples - WIP☆40Updated 6 years ago
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 3 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆13Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆27Updated 5 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆23Updated 2 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆19Updated 10 months ago
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- ☆16Updated 5 months ago