Codebase accompanying the Summary of a Haystack paper.
☆82Jun 25, 2026Updated this week
Alternatives and similar repositories for summary-of-a-haystack
Users that are interested in summary-of-a-haystack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆154Dec 22, 2025Updated 6 months ago
- ☆18Oct 23, 2024Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆234Apr 13, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 8, 2020Updated 5 years ago
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Official repo for "Make Your LLM Fully Utilize the Context"☆273May 15, 2024Updated 2 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆22Dec 2, 2024Updated last year
- Wind power and energy yield forecast☆11Updated this week
- ☆13Sep 12, 2024Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆18Dec 19, 2024Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- CVPR 2024 Research Paper with Code☆47Jun 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jun 2, 2025Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆110Sep 19, 2025Updated 9 months ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆21Jan 9, 2025Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆35Aug 24, 2024Updated last year
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- ☆20Nov 30, 2021Updated 4 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆208Feb 18, 2026Updated 4 months ago
- ☆62Jun 2, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆63Apr 18, 2024Updated 2 years ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆76Dec 25, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 3 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Includes examples on how to evaluate LLMs☆23Nov 4, 2024Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated last year
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆71May 13, 2025Updated last year
- Rotrics Code☆10Mar 21, 2021Updated 5 years ago
- ☆24Feb 5, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Open source library for few shot NLP☆79Updated this week
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆294Jun 26, 2025Updated last year
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆17Jun 7, 2023Updated 3 years ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆30Dec 18, 2024Updated last year