ServiceNow / drbenchLinks
An enterprise deep research benchmark
☆29Updated 2 months ago
Alternatives and similar repositories for drbench
Users that are interested in drbench are comparing it to the libraries listed below
Sorting:
- ☆37Updated 5 months ago
- Ensembling Hugging Face transformers made easy☆61Updated 3 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Updated 4 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Updated last year
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Updated 10 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆56Updated 5 months ago
- Repository for research in the field of Responsible NLP at Meta.☆204Updated 8 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Updated 2 years ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆35Updated 7 months ago
- ☆56Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆215Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50Updated 3 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆190Updated 6 months ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated 2 years ago
- ☆29Updated last year
- ☆42Updated last year
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- ☆80Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆112Updated last year
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Utilities for the HuggingFace transformers library☆74Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- ☆44Updated 2 years ago
- ☆44Updated 4 years ago
- ☆14Updated 11 months ago
- ☆102Updated 3 years ago
- ☆44Updated last year