UKPLab / acl2024-triple-encodersLinks

triple-encoders is a library for contextualizing distributed Sentence Transformers representations.

☆15

Alternatives and similar repositories for acl2024-triple-encoders

Users that are interested in acl2024-triple-encoders are comparing it to the libraries listed below

Sorting:

g8a9 / ear
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆50Updated 3 years ago
smallbenchnlp / ELECTRA-DeBERTa
☆16Updated 2 years ago
Mivg / SLED
The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper
☆70Updated 2 years ago
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆105Updated 2 years ago
sophiaalthammer / parm
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…
☆41Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
amazon-science / efficient-longdoc-classification
☆47Updated 3 years ago
huggingface / that_is_good_data
☆65Updated 2 years ago
sebastian-hofstaetter / colberter
☆46Updated 3 years ago
konstantinjdobler / focus
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆34Updated 5 months ago
castorini / mr.tydi
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
☆79Updated 3 years ago
kayoyin / interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆62Updated 3 years ago
terrierteam / pyterrier_colbert
☆87Updated 7 months ago
amazon-science / mintaka
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
☆116Updated 3 years ago
allenai / open-mds
The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …
☆32Updated 2 years ago
katzurik / NERetrieve
☆30Updated last year
jjzha / cartography-al
Code base for the EMNLP 2021 Findings paper: Cartography Active Learning
☆14Updated 5 months ago
castorini / dhr
Dense hybrid representations for text retrieval
☆63Updated 2 years ago
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆189Updated 4 years ago
oriram / spider
☆54Updated 2 years ago
DFKI-NLP / thermostat
Collection of NLP model explanations and accompanying analysis tools
☆144Updated 2 years ago
allenai / PRIMER
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
☆157Updated 3 years ago
flipz357 / S3BERT
Semantically Structured Sentence Embeddings
☆69Updated last year
Kaleidophon / awesome-experimental-standards-deep-learning
Repository collecting resources and best practices to improve experimental rigour in deep learning research.
☆27Updated 2 years ago
nyu-mll / SQuALITY
Query-focused summarization data
☆42Updated 2 years ago
salesforce / query-focused-sum
Official code repository for "Exploring Neural Models for Query-Focused Summarization".
☆50Updated 2 years ago
terrierteam / pyterrier_doc2query
☆37Updated 3 weeks ago
google-research-datasets / PropSegmEnt
PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…
☆21Updated 2 years ago
zouharvi / tokenization-scorer
Simple-to-use scoring function for arbitrarily tokenized texts.
☆47Updated 9 months ago
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Updated last year