microsoft / lost_in_conversationLinks
Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)
☆141Updated 2 weeks ago
Alternatives and similar repositories for lost_in_conversation
Users that are interested in lost_in_conversation are comparing it to the libraries listed below
Sorting:
- Complex Function Calling Benchmark.☆117Updated 5 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆115Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆190Updated 3 weeks ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆174Updated 2 weeks ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆138Updated 8 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆95Updated last month
- ☆160Updated 2 weeks ago
- The first dense retrieval model that can be prompted like an LM☆80Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆209Updated this week
- ☆104Updated 2 months ago
- ☆69Updated last month
- ☆89Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆94Updated 2 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆168Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆64Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 5 months ago
- ☆122Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆127Updated 2 months ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆114Updated 5 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆91Updated last month
- ☆124Updated 9 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆229Updated 8 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆206Updated last month
- Reproducible, flexible LLM evaluations☆215Updated 2 months ago
- Large language models for document ranking.☆60Updated 2 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆228Updated 8 months ago
- ☆55Updated this week