SWORDHealth / mind-evalLinks
AI Research team evaluation repository
β24Updated 2 weeks ago
Alternatives and similar repositories for mind-eval
Users that are interested in mind-eval are comparing it to the libraries listed below
Sorting:
- This is a framework that implements various parallel reasoning strategies from the literatureβ273Updated this week
- A Python toolkit for chain-of-thought prompting πβ180Updated last week
- Pixelagent β Multimodal stateful agentsβ223Updated 6 months ago
- Agent accuracy measurements for LLMsβ204Updated last year
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient β¦β222Updated last year
- Action library for AI Agentβ230Updated 8 months ago
- β86Updated 2 months ago
- See Through Your Modelsβ401Updated 5 months ago
- Dead Simple LLM Abliterationβ244Updated 10 months ago
- Structured Output Is All You Need!β59Updated last year
- state of the art browsing agent (WebArena 72.7%)β360Updated 2 months ago
- Chat strategies for LLMsβ125Updated this week
- Physical AI Assistant that illuminates your lifeβ189Updated 2 months ago
- β291Updated 6 months ago
- Applying the ideas of Deepseek R1 to computer useβ217Updated 10 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaboratβ¦β446Updated 3 weeks ago
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transparβ¦β227Updated 4 months ago
- Fast Diversification for Search & Retrievalβ454Updated last month
- Build Secure and Compliant AI agents and MCP Servers. YC W23β153Updated 6 months ago
- β298Updated 8 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.β238Updated 2 years ago
- Fully neural approach for text chunkingβ403Updated last month
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.β281Updated 2 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a ligβ¦β225Updated 11 months ago
- Browser-LLM Auto-Scaling Technologyβ767Updated 2 weeks ago
- Examples and guides for using the VLM Run APIβ300Updated this week
- An AI-generated book exploring how artificial intelligence development reveals hidden patterns in human cognition and communicationβ77Updated 3 weeks ago
- Praetor is a lightweight finetuning data and prompt management toolβ67Updated last year
- ai for jqβ248Updated last year
- β164Updated 8 months ago