Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
Alternatives and similar repositories for nanoColBERT:
Users that are interested in nanoColBERT are comparing it to the libraries listed below
- The first dense retrieval model that can be prompted like an LM☆68Updated 6 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆89Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆100Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- ☆47Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆122Updated 5 months ago
- ☆57Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated this week
- ☆67Updated 7 months ago
- ☆40Updated 2 months ago
- ☆48Updated 5 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆49Updated 9 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated 2 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆79Updated 4 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆132Updated 5 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 2 months ago
- This is the official repository for Inheritune.☆111Updated 2 months ago
- Pre-train Static Word Embeddings☆53Updated this week
- ☆119Updated 6 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆53Updated 4 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- Supercharge huggingface transformers with model parallelism.☆76Updated 6 months ago