nreimers / se-benchmark
☆9Updated 3 years ago
Alternatives and similar repositories for se-benchmark:
Users that are interested in se-benchmark are comparing it to the libraries listed below
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Code and Data for Evaluation WG☆41Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆135Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆82Updated 4 months ago
- ☆75Updated 3 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- ☆16Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆174Updated 2 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- ☆21Updated 3 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- ☆29Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 10 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆169Updated 3 years ago
- ☆97Updated 2 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆23Updated 2 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆38Updated last year
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- ☆55Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆153Updated last year