fau-masters-collected-works-cgarbin / datasheet-for-dataset-template
Template for datasheet for datasets
☆24Updated 2 years ago
Alternatives and similar repositories for datasheet-for-dataset-template:
Users that are interested in datasheet-for-dataset-template are comparing it to the libraries listed below
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆68Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- ☆30Updated 5 years ago
- ☆67Updated 11 months ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- ☆23Updated 2 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 10 months ago
- Tools for interactive visual exploration of semantic embeddings.☆30Updated 5 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆23Updated 7 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆38Updated 5 months ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- ☆31Updated this week
- ☆31Updated last year
- Tools for managing datasets for governance and training.☆82Updated 2 weeks ago
- ☆29Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆163Updated 8 months ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 3 months ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆35Updated last month
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆32Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆63Updated last year
- Course for Interpreting ML Models☆52Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Updated last year
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆44Updated this week
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- An open-source compliance-centered evaluation framework for Generative AI models☆131Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago