Batteries-included eval framework for search APIs
☆239Jun 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for search_evals
Users that are interested in search_evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated 3 weeks ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 9 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆40May 15, 2024Updated 2 years ago
- ☆14Sep 7, 2024Updated last year
- ☆95Mar 30, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Develop pro-code personal agents integrated with memory service on Teams☆28Jun 22, 2026Updated last week
- ☆21Sep 6, 2021Updated 4 years ago
- This sample shows how to build vector similarity search on Azure Cosmos DB for PostgreSQL using the pgvector extension and the multi-moda…☆11Jul 13, 2024Updated last year
- DDRel: A new dataset for interpersonal relation classification in dyadic dialogues☆23Sep 12, 2021Updated 4 years ago
- A collection of Zsh functions to augment Git☆19Dec 11, 2025Updated 6 months ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated last year
- ☆19Jan 3, 2025Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Dec 8, 2025Updated 6 months ago
- Safe Python Code Execution Environment for Language Models☆17Jun 20, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- "TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]☆33Dec 21, 2024Updated last year
- Metadata browser of TREC☆10May 19, 2026Updated last month
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆101Apr 9, 2025Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- collaborative web tool to enrich content☆11Nov 13, 2011Updated 14 years ago
- Foundry IQ Demo☆35Jan 25, 2026Updated 5 months ago
- Generative Reranker PyTerrier☆18Dec 1, 2025Updated 6 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Chat with agents 🤖 and see their thoughts 💭☆15Jul 8, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- CATransformers is a framework for joint neural network and hardware architecture search.☆24Mar 17, 2026Updated 3 months ago
- ☆14Apr 23, 2025Updated last year
- ☆12Apr 30, 2019Updated 7 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆39Oct 1, 2025Updated 8 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆18Dec 19, 2024Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Dense hybrid representations for text retrieval☆65Apr 3, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated 2 years ago
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated 4 months ago
- [ICLR 2026] JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence☆79May 9, 2026Updated last month
- Indexing Sharepoint Online Content to Azure Cognitive Search☆38Nov 15, 2024Updated last year
- ☆23Jul 23, 2025Updated 11 months ago
- ☆28Apr 19, 2026Updated 2 months ago