Viewer for the π€ datasets library.
β86Jul 30, 2021Updated 4 years ago
Alternatives and similar repositories for datasets-viewer
Users that are interested in datasets-viewer are comparing it to the libraries listed below
Sorting:
- Evaluate Transformers from the Hub π₯β14Nov 27, 2023Updated 2 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTaβ18Aug 30, 2019Updated 6 years ago
- β10Jul 13, 2022Updated 3 years ago
- β13Mar 27, 2020Updated 5 years ago
- β104Jan 14, 2021Updated 5 years ago
- **ARCHIVED** Filesystem interface to π€ Hubβ59Apr 6, 2023Updated 2 years ago
- An implementation of BERT using PyTorch's TransformerEncoderβ32Dec 15, 2019Updated 6 years ago
- π§ Train off-the-shelf machine learning models in one line of codeβ12Mar 12, 2021Updated 4 years ago
- A Streamlit app to add structured tags to a dataset cardβ22Jun 30, 2022Updated 3 years ago
- A starter kit for evaluating benchmarks on the π€ Hubβ16Dec 29, 2023Updated 2 years ago
- β99Jul 7, 2020Updated 5 years ago
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Apr 30, 2023Updated 2 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and Englishβ40Apr 28, 2022Updated 3 years ago
- A web application that generates stories based on genres. Created by fine-tuning GPT2 on genre-based stories.β14Apr 14, 2021Updated 4 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrievalβ43Jun 12, 2023Updated 2 years ago
- PyTorch code for meta seq2seq learningβ43Jan 14, 2020Updated 6 years ago
- Official code for the paper: "Metadata Archaeology"β19May 10, 2023Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β359Feb 22, 2022Updated 4 years ago
- jiant is an nlp toolkitβ1,674Jul 6, 2023Updated 2 years ago
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)β36Jul 22, 2021Updated 4 years ago
- β10Feb 2, 2021Updated 5 years ago
- Data from KAIST (a Korean treebank).β19Nov 12, 2025Updated 3 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.β86Apr 21, 2021Updated 4 years ago
- β12Aug 15, 2023Updated 2 years ago
- New datasetβ311Aug 31, 2021Updated 4 years ago
- Minimal module for computing audio spectrogramsβ15Feb 28, 2019Updated 7 years ago
- β12Nov 30, 2022Updated 3 years ago
- β11Aug 12, 2020Updated 5 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"β25Nov 2, 2021Updated 4 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).β44Dec 25, 2022Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science dayβ2,052Oct 31, 2020Updated 5 years ago
- β76Oct 25, 2021Updated 4 years ago
- Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/β251Feb 5, 2024Updated 2 years ago
- β221Jun 8, 2020Updated 5 years ago
- A queue service for quickly developing scripts that use all your GPUs efficientlyβ88Sep 25, 2022Updated 3 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,402Nov 7, 2025Updated 3 months ago
- Fast, general, and tested differentiable structured prediction in PyTorchβ1,123Apr 20, 2022Updated 3 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhiβ¦β49Apr 26, 2021Updated 4 years ago
- A Translation Task using TurboTransformersβ11Dec 17, 2020Updated 5 years ago