browser-use / evalLinks
☆42Updated 10 months ago
Alternatives and similar repositories for eval
Users that are interested in eval are comparing it to the libraries listed below
Sorting:
- ☆33Updated 2 years ago
- Voyage AI Official Python Library☆81Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated last month
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆116Updated 4 months ago
- Create embeddings with infinity as serverless endpoint☆41Updated 6 months ago
- proof-of-concept of Cursor's Instant Apply feature☆85Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated this week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆42Updated last year
- Training setup for Langchain's Open Deep Research☆71Updated 2 months ago
- A list of AI memory projects☆247Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆92Updated last month
- DSPY on action with OpenSource LLMs.☆98Updated last year
- A function to do all☆35Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆71Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆219Updated last year
- Convert a web page to markdown☆80Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 10 months ago
- Verbosity control for AI agents☆64Updated last year
- A toolkit for building computer use AI agents☆178Updated 4 months ago
- Query Expension for Better Query Embedding using LLMs☆62Updated 9 months ago
- Embedding models from Jina AI☆65Updated last year
- Agent that routes to different tools - LLM classifier SDK☆44Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 8 months ago
- Official Repo for CRMArena and CRMArena-Pro☆125Updated 2 weeks ago
- A collection of Compound Retrieval Systems implemented with DSPy and Weaviate.☆91Updated last month
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago