shaharl6000 / MoreDocsSameLen
This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retrieval-Augmented Generation (RAG) systems.
☆14Updated last month
Alternatives and similar repositories for MoreDocsSameLen:
Users that are interested in MoreDocsSameLen are comparing it to the libraries listed below
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆24Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 2 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- ☆15Updated 2 weeks ago
- Official Repository for Task-Circuit Quantization☆15Updated 2 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 6 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 6 months ago
- ☆62Updated 3 weeks ago
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 weeks ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆15Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆16Updated 9 months ago
- MEXMA: Token-level objectives improve sentence representations☆40Updated 3 months ago
- ☆13Updated 4 months ago
- ☆21Updated last month
- ☆19Updated 2 weeks ago
- ☆20Updated last month
- ☆21Updated 6 months ago
- ☆15Updated 2 weeks ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 4 months ago
- Control LLM☆14Updated 3 weeks ago
- Repository for Skill Set Optimization☆12Updated 9 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated 2 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 2 months ago