Batteries-included eval framework for search APIs
☆218May 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for search_evals
Users that are interested in search_evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]☆33Apr 2, 2026Updated last month
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆19Sep 17, 2025Updated 8 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆39May 15, 2024Updated 2 years ago
- ☆12Sep 7, 2024Updated last year
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆40Feb 1, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆89Mar 30, 2026Updated last month
- A sample app for the Retrieval-Augmented Generation pattern using LlamaIndex.ts, running in Azure, using Azure AI Search for retrieval an…☆14Jan 29, 2026Updated 3 months ago
- Develop pro-code personal agents integrated with memory service on Teams☆28May 12, 2026Updated last week
- Repository that hold the information about the competition about validating and fixing relationship direction in Cypher statements based …☆27Sep 18, 2023Updated 2 years ago
- This sample shows how to build vector similarity search on Azure Cosmos DB for PostgreSQL using the pgvector extension and the multi-moda…☆11Jul 13, 2024Updated last year
- DDRel: A new dataset for interpersonal relation classification in dyadic dialogues☆23Sep 12, 2021Updated 4 years ago
- A collection of Zsh functions to augment Git☆19Dec 11, 2025Updated 5 months ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 11 months ago
- Preparing for ML Interviews.☆53Jan 12, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [WWW 2023] Official code of "Adap-$\tau$: Adaptively Modulating Embedding Magnitude for Recommendation"☆29Jan 4, 2024Updated 2 years ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆90Sep 4, 2025Updated 8 months ago
- ☆19Jan 3, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 5 months ago
- Safe Python Code Execution Environment for Language Models☆17Updated this week
- Metadata browser of TREC☆10May 13, 2026Updated last week
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆101Apr 9, 2025Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- collaborative web tool to enrich content☆12Nov 13, 2011Updated 14 years ago
- Generative Reranker PyTerrier☆18Dec 1, 2025Updated 5 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- A Multi-domain Benchmark for Personalized Search Evaluation☆12Sep 7, 2023Updated 2 years ago
- Use Azure Cognitive Services with React - Face API as an example☆10Mar 3, 2023Updated 3 years ago
- ☆13Apr 23, 2025Updated last year
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆38Oct 1, 2025Updated 7 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- Bi-LSTM - CRF Named Entity Recognition model for Korean (Keras)☆16Feb 7, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Using Azure OpenAI GPT 4o to extract information such as text, tables and charts from Documents to Markdown☆28Jan 26, 2025Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- ☆10Mar 11, 2024Updated 2 years ago
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated 3 months ago
- GenAI Examples☆16Dec 13, 2024Updated last year
- Indexing Sharepoint Online Content to Azure Cognitive Search☆37Nov 15, 2024Updated last year