☆43Apr 22, 2025Updated 10 months ago
Alternatives and similar repositories for REBEL
Users that are interested in REBEL are comparing it to the libraries listed below
Sorting:
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 7 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆20Feb 23, 2026Updated last week
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 5 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- ☆11Feb 25, 2025Updated last year
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- A curated list of awesome papers about utilizing large language models for ranking.☆31Oct 30, 2024Updated last year
- ☆14Jul 7, 2024Updated last year
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Jul 5, 2017Updated 8 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- ☆37May 5, 2025Updated 9 months ago
- Morpha lex stemmer converted using jflex.☆24Oct 12, 2020Updated 5 years ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Jun 28, 2025Updated 8 months ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 10 months ago
- An introduction to DSPy☆34Aug 30, 2025Updated 6 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- ☆19May 16, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- A fully integrated Django x Next.js tutorial on Google Docs☆17Mar 10, 2025Updated 11 months ago
- Efficient BM25 with DuckDB 🦆☆64Dec 20, 2024Updated last year
- Model implementation for the contextual embeddings project☆40Jun 2, 2025Updated 9 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 10 months ago
- ☆19Jan 3, 2025Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Mar 2, 2025Updated last year
- GLiNER model in a FastAPI microservice.☆47Dec 11, 2024Updated last year
- Retrieval-Augmented Generation battle!☆62Jul 31, 2025Updated 7 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Feb 7, 2026Updated 3 weeks ago
- Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆39Aug 13, 2025Updated 6 months ago
- Benchmarking library for RAG☆260Feb 15, 2026Updated 2 weeks ago
- Late Interaction Models Training & Retrieval☆732Updated this week
- Query Expension for Better Query Embedding using LLMs☆67Feb 18, 2025Updated last year
- Code for our EMNLP 2020 paper "Uncertainty-Aware Label Refinement for Sequence Labeling"☆22Oct 4, 2020Updated 5 years ago
- PyLate efficient inference engine☆73Jan 7, 2026Updated last month
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago