shaharl6000 / MoreDocsSameLenLinks
This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retrieval-Augmented Generation (RAG) systems.
☆17Updated 8 months ago
Alternatives and similar repositories for MoreDocsSameLen
Users that are interested in MoreDocsSameLen are comparing it to the libraries listed below
Sorting:
- ☆67Updated 8 months ago
- ☆25Updated 2 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- MEXMA: Token-level objectives improve sentence representations☆42Updated 10 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 7 months ago
- ☆24Updated last year
- ☆20Updated 7 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆51Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆35Updated 3 months ago
- ☆25Updated 2 weeks ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated last year
- ☆17Updated 7 months ago
- ☆35Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixing☆31Updated 10 months ago
- ☆49Updated 7 months ago
- ☆41Updated 6 months ago
- UQ: Assessing Language Models on Unsolved Questions☆28Updated 3 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 7 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated 2 weeks ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆21Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆24Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆20Updated last year
- A massively multilingual modern encoder language model☆113Updated last month
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆14Updated last month