Talc-AI / search-benchLinks
☆19Updated 11 months ago
Alternatives and similar repositories for search-bench
Users that are interested in search-bench are comparing it to the libraries listed below
Sorting:
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated 11 months ago
- ☆160Updated this week
- ☆93Updated last year
- Prompt leak technique for Bing Chat☆34Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.☆134Updated last year
- A very simple cross-service LLM API for Python☆21Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆99Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆114Updated last month
- Globe Engineer - Handkerchief: A higher quality alternative to vector database RAG.☆25Updated last year
- Data about 349K OpenAI Custom GPTs☆145Updated last year
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆166Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆152Updated 2 weeks ago
- Test suite for LLM prompts☆52Updated last year
- Python SDK for running evaluations on LLM generated responses☆291Updated 2 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆78Updated 6 months ago
- ☆161Updated 3 weeks ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆499Updated last year
- Prompt engineering, automated.☆340Updated 4 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆302Updated 2 months ago
- Together Open Deep Research☆342Updated 4 months ago
- Annoucing Instructor Cloud☆37Updated last year
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆209Updated 6 months ago
- Hosted embedding platform to discover, evaluate, and retrieve embeddings☆73Updated last year
- ☆172Updated last year
- Python client library for improving your LLM app accuracy☆98Updated 6 months ago
- ☆52Updated 4 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆112Updated 9 months ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆306Updated 8 months ago
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆98Updated last year
- ☆413Updated last year