huggingface / datasets-viewer
Viewer for the π€ datasets library.
β83Updated 3 years ago
Related projects: β
- β73Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselinesβ131Updated last year
- LM Pretraining with PyTorch/TPUβ131Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β49Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answeringβ114Updated 3 years ago
- Hyperparameter Search for AllenNLPβ134Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleβ152Updated 8 months ago
- β86Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setupβ120Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β146Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"β200Updated 3 years ago
- Code and Data for Evaluation WGβ41Updated 2 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.β105Updated 3 years ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"β119Updated last year
- Google's BigBird (Jax/Flax & PyTorch) @ π€Transformersβ47Updated last year
- State of the art Semantic Sentence Embeddingsβ97Updated 2 years ago
- On Generating Extended Summaries of Long Documentsβ77Updated 3 years ago
- β46Updated 4 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLPβ107Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAXβ27Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020β¦β32Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".β188Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ63Updated last year
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ70Updated 3 years ago
- β29Updated 2 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"β116Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β91Updated last year
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchβ75Updated 3 years ago
- Fine-tune transformers with pytorch-lightningβ44Updated 2 years ago
- A benchmark for code-switched NLP, ACL 2020β74Updated 3 months ago