UKPLab / acl2024-triple-encoders
triple-encoders is a library for contextualizing distributed Sentence Transformers representations.
☆13Updated 5 months ago
Alternatives and similar repositories for acl2024-triple-encoders:
Users that are interested in acl2024-triple-encoders are comparing it to the libraries listed below
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 8 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 2 years ago
- ☆45Updated 2 years ago
- ☆28Updated last year
- ☆57Updated 2 years ago
- ☆36Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆26Updated 5 months ago
- Semantically Structured Sentence Embeddings☆66Updated 3 months ago
- ☆55Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated last year
- ☆16Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkit☆63Updated last year
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated last week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 9 months ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Updated last year
- ☆31Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated 2 years ago
- Cross language information retrieval pipeline☆18Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- ☆84Updated 5 months ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆31Updated 9 months ago
- ☆41Updated 3 years ago
- ☆33Updated last year