Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so your GenAI-powered solution has predictable and reliable performance.
☆104Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for deepmark
Users that are interested in deepmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyze your image in seconds with AI☆63May 28, 2024Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆866Jan 15, 2024Updated 2 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Automate UI testing + functionality testing with GPT-4 Vision☆45Dec 17, 2023Updated 2 years ago
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Dec 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Explore GitHub repositories with natural language questions☆98Dec 22, 2024Updated last year
- AI Research Agent - Search, Scrape, Summarize, Analysis☆15Nov 11, 2023Updated 2 years ago
- ⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metada…☆229Nov 21, 2023Updated 2 years ago
- This project is designed to make it more efficient to collate data from a Youtube Channel to create custom GPTs, train models or for use …☆14Dec 2, 2023Updated 2 years ago
- AI Developer is an AI agent powered by GPT-4-Turbo that's using custom E2B Sandbox☆55Feb 11, 2025Updated last year
- Documentation for the Krixik Python client.☆38Nov 8, 2024Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆64Mar 26, 2024Updated 2 years ago
- ☆11Aug 28, 2023Updated 2 years ago
- A complete guide to evaluate LLMs and RAGs. Both theory and code based approaches covered.☆28Nov 16, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated 2 months ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Nov 16, 2024Updated last year
- A fast and minimal framework for building agentic systems☆484May 8, 2026Updated last week
- Simplistic and minimalist storage.☆24May 11, 2025Updated last year
- LLM Chain querying a scientific Zotero library, with citations☆441Aug 4, 2023Updated 2 years ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- Load generator for TCP servers.☆20Mar 28, 2024Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆178Nov 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17May 3, 2024Updated 2 years ago
- HeroML is an AI Prompt Chain/Workflow interpreter for Apps built on https://hero.page☆55Aug 10, 2023Updated 2 years ago
- Ipython notebook copy of Andrej Karpathy's llama2.c☆23Sep 5, 2023Updated 2 years ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- Large language model evaluation and workflow framework from Phase AI.☆458Jan 21, 2025Updated last year
- POC of a phone used as SMS gateway to serve queries to chatGPT over GSM network using the regular Android message app.☆19Jul 18, 2023Updated 2 years ago
- Clint LLM GitHub Pages☆83Sep 12, 2024Updated last year
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆35Mar 25, 2026Updated last month
- Python SDK for running evaluations on LLM generated responses☆300Jun 6, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a tool that uses GPT4 Vision to operate your computer☆30Dec 19, 2023Updated 2 years ago
- With just Llama-2, generate full React codebases from a single prompt☆103Aug 25, 2023Updated 2 years ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆451Nov 24, 2025Updated 5 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆495Nov 28, 2023Updated 2 years ago
- Wraps openai.ChatCompletion to produce pydantic model output via schema prompt and error feedback.☆54Jun 4, 2023Updated 2 years ago
- Evaluating LLMs with CommonGen-Lite☆95Mar 21, 2024Updated 2 years ago