ServiceNow / drbenchLinks
An enterprise deep research benchmark
☆25Updated 3 weeks ago
Alternatives and similar repositories for drbench
Users that are interested in drbench are comparing it to the libraries listed below
Sorting:
- ☆79Updated last year
- ☆36Updated 4 months ago
- Repository for paper Decrypting Cryptic Crosswords☆10Updated 3 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- Inquisitive Parrots for Search☆198Updated 5 months ago
- Ensembling Hugging Face transformers made easy☆62Updated 2 years ago
- Synthetic Data Generation for Evaluation☆13Updated 9 months ago
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 6 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆218Updated last year
- ☆55Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 4 months ago
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆12Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆30Updated last month
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Benchmarking Large Language Models☆101Updated 5 months ago
- ☆84Updated 2 years ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 3 years ago
- Utilities for the HuggingFace transformers library☆72Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 3 months ago
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"☆64Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- ☆29Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆97Updated 2 years ago
- ☆44Updated 2 years ago
- A diff tool for language models☆44Updated last year
- ☆54Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆34Updated 5 months ago