The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIR2022
☆16May 4, 2022Updated 3 years ago
Alternatives and similar repositories for CharacterBERT-DR
Users that are interested in CharacterBERT-DR are comparing it to the libraries listed below
Sorting:
- Implementation and results for ICTIR2021 paper: Effective and Privacy-preserving Federated Online Learning to Rank☆10Jul 24, 2021Updated 4 years ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 5 months ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- ☆16Dec 14, 2022Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- Exploring semantic similarities between contextualized embeddings☆14May 18, 2021Updated 4 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- ☆21Apr 17, 2023Updated 2 years ago
- Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "…☆18Feb 15, 2022Updated 4 years ago
- Entailment self-training☆27May 30, 2023Updated 2 years ago
- We introduce the direct document relevance optimization (DDRO) for training a pairwise ranker model. DDRO encourages the model to focus o…☆35Jan 10, 2026Updated last month
- An Open-Source Package for Information Retrieval☆168Updated this week
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- ☆30Sep 25, 2024Updated last year
- Language Models as Hierarchy Encoders☆39Jan 6, 2026Updated last month
- Collections of IR Research☆37May 18, 2025Updated 9 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Dec 14, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆48Dec 14, 2023Updated 2 years ago
- ☆14Feb 25, 2026Updated last week
- EOSIO-Taurus - The Most Powerful Infrastructure for Decentralized Applications☆13Mar 29, 2024Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 8 months ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- Not just a PDE toolbox. Adapt your ideas from a clean, modular code base with Femeko.☆15Feb 22, 2026Updated last week
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago