Viewer for the 🤗 datasets library.
☆86Jul 30, 2021Updated 4 years ago
Alternatives and similar repositories for datasets-viewer
Users that are interested in datasets-viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Sep 27, 2021Updated 4 years ago
- ☆13Mar 27, 2020Updated 5 years ago
- 🔧 Train off-the-shelf machine learning models in one line of code☆12Mar 12, 2021Updated 5 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Dec 29, 2023Updated 2 years ago
- ☆104Jan 14, 2021Updated 5 years ago
- PyTorch code for meta seq2seq learning☆43Jan 14, 2020Updated 6 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆59Apr 6, 2023Updated 2 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆21Jan 8, 2026Updated 2 months ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- An implementation of BERT using PyTorch's TransformerEncoder☆32Dec 15, 2019Updated 6 years ago
- A Graph-based Pattern Representations Tutorial☆10Jul 15, 2019Updated 6 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- ☆10Jul 13, 2022Updated 3 years ago
- ☆10Feb 2, 2021Updated 5 years ago
- Data from KAIST (a Korean treebank).☆19Nov 12, 2025Updated 4 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)☆200Jul 6, 2023Updated 2 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- jiant is an nlp toolkit☆1,674Jul 6, 2023Updated 2 years ago
- Interpreting Sarcasm with Sentiment Based Monolingual Machine Translation☆11May 7, 2017Updated 8 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Jun 12, 2023Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Mar 17, 2020Updated 6 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,054Oct 31, 2020Updated 5 years ago
- homework of coursera nlp course. https://www.coursera.org/learn/language-processing/home/welcome☆15Dec 7, 2022Updated 3 years ago
- ☆13Oct 28, 2020Updated 5 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆35May 15, 2023Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- 🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code☆2,825Jun 23, 2023Updated 2 years ago
- Developing tools to automatically analyze datasets☆75Oct 29, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- A queue service for quickly developing scripts that use all your GPUs efficiently☆88Sep 25, 2022Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago