To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact retrieval to more challenging single-document/multi-document question answering tasks.
☆13Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for NeedleInAHaystack-PLUS
Users that are interested in NeedleInAHaystack-PLUS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- Agentic Keyframe Search for Video Question Answering☆17Apr 7, 2025Updated last year
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- This is the official repo for our paper: "Generative Knowledge-Guided Retrieval System for Construction Disclosure Documents Reviewing"☆22Nov 17, 2025Updated 4 months ago
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆43Dec 12, 2025Updated 3 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆10Feb 17, 2024Updated 2 years ago
- This is a Repository corresponding to ACCV2022 accepted paper ”Complex Handwriting Trajectory Recovery: Evaluation Metrics and Algorithm“…☆13Oct 3, 2022Updated 3 years ago
- ☆14Dec 19, 2024Updated last year
- ☆17Jun 14, 2023Updated 2 years ago
- A Rust implementation of Yolo for object detection and tracking.☆10Nov 17, 2022Updated 3 years ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- 集成了多个搜索引擎和娱乐, 科研, 工具, 生活常用网址的导航页☆13May 30, 2025Updated 10 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆113Feb 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- ☆10Mar 24, 2023Updated 3 years ago
- Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network, ICPR 2018.☆15Jul 17, 2019Updated 6 years ago
- ☆26Feb 6, 2023Updated 3 years ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆28Mar 18, 2026Updated 3 weeks ago
- [COLING2020] A challenge dataset for Person SenTiment analysis in news domain.☆11May 2, 2022Updated 3 years ago
- Notebook on computer science, linguistics, deep learning, opinions, and life. Making it bilingual (en/zh).☆14Feb 13, 2026Updated last month
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- [NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis☆25Nov 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Mar 26, 2024Updated 2 years ago
- 目标:构建一个更符合语言学的小 而美的 llama 分词器,支持中英日三国语言☆20Jun 2, 2024Updated last year
- .NET style property grid, written in Plain JavaScript as Web component☆13Mar 4, 2023Updated 3 years ago
- Example implemention of the Proximal Policy Optimization algorithm☆17Jul 25, 2024Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- Shen Zhou, Tieyun Qian: On the Strength of Sequence Labeling and Generative Models for Aspect Sentiment Triplet Extraction. Findings of A…☆12May 26, 2023Updated 2 years ago
- ☆12Apr 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 7 months ago
- ☆16Aug 23, 2023Updated 2 years ago
- CODA: Repurposing Continuous VAEs for Discrete Tokenization☆35Jul 4, 2025Updated 9 months ago
- Demo for Neuro-Symbolic Agent (LOA)☆17Sep 27, 2022Updated 3 years ago
- Example of running MDX on Druid via Mondrian and Calcite☆26Aug 3, 2016Updated 9 years ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆20Oct 22, 2025Updated 5 months ago