Batteries-included eval framework for search APIs
☆225Jun 2, 2026Updated last week
Alternatives and similar repositories for search_evals
Users that are interested in search_evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆16Jun 1, 2026Updated last week
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 9 months ago
- Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]☆33Apr 2, 2026Updated 2 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆40May 15, 2024Updated 2 years ago
- ☆13Sep 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆42Oct 17, 2025Updated 7 months ago
- Repository that hold the information about the competition about validating and fixing relationship direction in Cypher statements based …☆27Sep 18, 2023Updated 2 years ago
- ☆29Apr 30, 2026Updated last month
- This sample shows how to build vector similarity search on Azure Cosmos DB for PostgreSQL using the pgvector extension and the multi-moda…☆11Jul 13, 2024Updated last year
- A collection of Zsh functions to augment Git☆19Dec 11, 2025Updated 5 months ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆90Sep 4, 2025Updated 9 months ago
- Reference architecture, guides, and examples using Amazon Bedrock and Redis as a knowledge base for RAG.☆15Oct 21, 2023Updated 2 years ago
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Dec 8, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆80Dec 8, 2025Updated 6 months ago
- Safe Python Code Execution Environment for Language Models☆17May 19, 2026Updated 3 weeks ago
- Metadata browser of TREC☆10May 19, 2026Updated 3 weeks ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆101Apr 9, 2025Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- Foundry IQ Demo☆35Jan 25, 2026Updated 4 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- Chat with agents 🤖 and see their thoughts 💭☆15Jul 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Multi-domain Benchmark for Personalized Search Evaluation☆12Sep 7, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Use Azure Cognitive Services with React - Face API as an example☆10Mar 3, 2023Updated 3 years ago
- ☆14Apr 23, 2025Updated last year
- ☆12Apr 30, 2019Updated 7 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆39Oct 1, 2025Updated 8 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- Dense hybrid representations for text retrieval☆65Apr 3, 2023Updated 3 years ago
- ☆25Mar 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Using Azure OpenAI GPT 4o to extract information such as text, tables and charts from Documents to Markdown☆28Jan 26, 2025Updated last year
- ☆10Mar 11, 2024Updated 2 years ago
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated 4 months ago
- Jig for the Open-Source IR Replicability Challenge (OSIRRC)☆13Dec 8, 2022Updated 3 years ago
- ☆13Mar 9, 2024Updated 2 years ago
- Indexing Sharepoint Online Content to Azure Cognitive Search☆37Nov 15, 2024Updated last year