fau-masters-collected-works-cgarbin / datasheet-for-dataset-template
Template for datasheet for datasets
☆24Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for datasheet-for-dataset-template
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated last year
- Biomedical Data-to-Text Generation via Fine-Tuning Transformers☆29Updated 2 years ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆19Updated 4 months ago
- ☆30Updated 4 years ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Intelligence Task Ontology (ITO)☆73Updated 2 years ago
- multimodal document analysis☆160Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆38Updated 2 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆88Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- Self-verification for LLMs.☆62Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Updated 3 years ago
- Course for Interpreting ML Models☆52Updated last year
- ☆22Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆12Updated last month
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆27Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆46Updated 9 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 6 months ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆33Updated last year
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆30Updated last year
- Platform enabling Rapid Annotation for Clinical Entity Recognition☆50Updated 2 years ago
- ☆21Updated this week
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆46Updated 9 months ago
- ☆68Updated 8 months ago
- This repository contains the PLOD Dataset for Abbreviation Detection released with our LREC 2022 publication☆11Updated 2 years ago