staoxiao / RetroMAE
Codebase for RetroMAE and beyond.
☆249Updated 8 months ago
Alternatives and similar repositories for RetroMAE:
Users that are interested in RetroMAE are comparing it to the libraries listed below
- Build Text Rerankers with Deep Language Models☆258Updated last year
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆153Updated last year
- ☆209Updated 2 years ago
- An Open-Source Package for Information Retrieval☆160Updated 2 weeks ago
- ☆162Updated last year
- Zero-shot Document Ranking with Large Language Models.☆109Updated 7 months ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆67Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.☆312Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆218Updated 11 months ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆561Updated this week
- A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"☆169Updated 2 years ago
- NAACL2021 - COIL Contextualized Lexical Retriever☆152Updated 3 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆334Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆307Updated last year
- SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.☆131Updated 3 years ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆135Updated last year
- ☆271Updated last year
- ☆105Updated last year
- A toolkit for building dense retrievers with deep language models.☆57Updated 3 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆197Updated last year
- Scalable training for dense retrieval models.☆275Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆240Updated last year
- https://acl2023-retrieval-lm.github.io/☆153Updated last year
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆367Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆88Updated 10 months ago
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.☆111Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆166Updated last year
- The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shen…☆118Updated last year