browser-use / evalLinks
☆42Updated 9 months ago
Alternatives and similar repositories for eval
Users that are interested in eval are comparing it to the libraries listed below
Sorting:
- proof-of-concept of Cursor's Instant Apply feature☆83Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- ☆33Updated 2 years ago
- Voyage AI Official Python Library☆80Updated last month
- Simple Graph Memory for AI applications☆89Updated 5 months ago
- Natural Language Interfaces Powered by LLMs☆92Updated last year
- ☆95Updated last year
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆49Updated last year
- Aider's refactoring benchmark exercises based on popular python repos☆77Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆41Updated last year
- A list of AI memory projects☆234Updated 9 months ago
- A daemon that makes a desktop OS accessible to AI agents☆33Updated 4 months ago
- ☆40Updated 5 months ago
- Anthropic Computer Use with Modal Sandboxes☆40Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆69Updated last year
- Code interpreter support for o1☆32Updated last year
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- ☆16Updated 9 months ago
- The official repository for the Anything But Wrappers: Llama Edition Hackameetup☆22Updated 2 years ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated 2 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 8 months ago
- Crawl and convert any website into clean markdown☆61Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- ☆114Updated last month
- A toolkit for building computer use AI agents☆176Updated 3 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆34Updated 6 months ago
- Query Expension for Better Query Embedding using LLMs☆58Updated 8 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆75Updated last year
- Embedding models from Jina AI☆65Updated last year
- Python SDK for Browserbase☆65Updated last week