bailuding / rails
Efficient Retrieval with Learned Similarities
☆11Updated last month
Related projects: ⓘ
- ☆69Updated last year
- ☆24Updated last year
- [NeurIPS 2023] Model-enhanced Vector Index☆21Updated 4 months ago
- Transformer-based Realtime User Action Model for Recommendation at Pinterest☆49Updated last year
- Code of Paper "ReLLa: Retrieval-enhanced Large Language Models for Mitigating Long Context Problems in Recommendation".☆33Updated 6 months ago
- Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval☆15Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆11Updated last year
- ☆19Updated 8 months ago
- ☆26Updated 2 months ago
- JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions☆27Updated last year
- Language Models as Semantic Indexers (ICML 2024)☆19Updated 4 months ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆22Updated last year
- ☆10Updated 11 months ago
- ☆33Updated 2 years ago
- Recommender systems with large language models (Paper list)☆57Updated 10 months ago
- ☆14Updated 2 years ago
- ☆60Updated 6 months ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting☆60Updated 6 months ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆115Updated last month
- Differentiable Product Quantization for End-to-End Embedding Compression.☆58Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆20Updated 2 months ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆49Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- This is the official PyTorch implementation for the paper: "Directed Acyclic Graph Factorization Machines for CTR Prediction via Knowledg…☆13Updated last year
- The code for the paper "RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction" (WWW 24 short paper)☆25Updated 6 months ago
- ☆56Updated 5 months ago
- ☆12Updated last year
- Unbiased Learning To Rank Algorithms (ULTRA)☆93Updated last year
- A repository sharing the literatures about large language models☆19Updated last month
- Source code of CIKM 2022 and DLP-KDD workshop 2022 Best Paper: IntTower-“ IntTower: the Next Generation of Two-Tower Model for Pre-rankin…☆58Updated last year