Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)
☆22Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for LED
Users that are interested in LED are comparing it to the libraries listed below
Sorting:
- Source code of paper 'Open Hierarchical Relation Extraction' (NAACL 2021)☆22Mar 4, 2022Updated 4 years ago
- Demo for advanced Java final project in 18-19 1 of Canghong Jin☆25Nov 18, 2018Updated 7 years ago
- Must-read papers on Fine-grained Entity Typing☆19Jul 7, 2022Updated 3 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks☆34Aug 31, 2022Updated 3 years ago
- 🌸 A note-taking web app designed to keep track of your daily to-do and work schedule.☆29Apr 18, 2023Updated 2 years ago
- ☆21Apr 17, 2023Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval☆16Mar 1, 2022Updated 4 years ago
- Hierarchical entity typing via multi-level learning to rank☆12Oct 13, 2020Updated 5 years ago
- ☆43Aug 15, 2023Updated 2 years ago
- SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction☆27Nov 8, 2022Updated 3 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- [TMLR'26] UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models☆54Mar 10, 2026Updated last week
- Hybrid List Aware Transformer Reranking☆19Oct 25, 2022Updated 3 years ago
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Mar 29, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- Submission archive for the MS MARCO document ranking leaderboard☆31Oct 9, 2023Updated 2 years ago
- Dual Cross Encoder for Dense Retrieval☆17Mar 15, 2023Updated 3 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆508Apr 4, 2024Updated last year
- This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".☆29Aug 25, 2025Updated 6 months ago
- ☆11May 24, 2024Updated last year
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Aug 7, 2024Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- Will be updated continuously.☆38Mar 2, 2022Updated 4 years ago
- ☆80Jan 23, 2023Updated 3 years ago
- The source code for paper--MORE: A Metric learning based framework for Open-domain Relation Extraction.☆12Jan 15, 2021Updated 5 years ago
- Simple ChatGPT interface for shell and macOS Alfred workflow☆13Oct 3, 2025Updated 5 months ago
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated last month
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year