A complete guide to evaluate LLMs and RAGs. Both theory and code based approaches covered.
β28Nov 16, 2023Updated 2 years ago
Alternatives and similar repositories for Evaluation-of-LLMs-and-RAGs
Users that are interested in Evaluation-of-LLMs-and-RAGs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt Injection & Prevention techniques. Secure your AI Chatbots built using LLMs.β13Mar 23, 2024Updated 2 years ago
- A code sample that shows how to use π¦οΈπlangchain, π¦llama_index and a hosted LLM endpoint to do a standard chat or Q&A about a pdf docβ¦β19Oct 24, 2023Updated 2 years ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-designβ22Dec 13, 2024Updated last year
- Question Answer Generation App from the documents. Primarily suited to Teachers and related Academia's posts.β28Aug 11, 2023Updated 2 years ago
- β11Apr 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Medical Mixture of Experts LLM using Mergekit.β20Mar 6, 2024Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ43Feb 15, 2024Updated 2 years ago
- Phi-2 Fine Tuning to build a mental health GPT.β11Jan 6, 2024Updated 2 years ago
- DeepScenario: An Open Driving Scenario Dataset for Autonomous Driving System Testingβ40Jan 26, 2024Updated 2 years ago
- Playing with RAG using Ollama, Langchain, and Streamlit. This project aims to demonstrate how a recruiter or HR personnel can benefit froβ¦β16Jan 21, 2024Updated 2 years ago
- β16Mar 10, 2024Updated 2 years ago
- β33Sep 25, 2024Updated last year
- β23Jan 18, 2024Updated 2 years ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMsβ64Mar 26, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β24Jun 12, 2024Updated last year
- Phi-2 Colab Notebookβ14Dec 14, 2023Updated 2 years ago
- β25Aug 20, 2024Updated last year
- β14Nov 20, 2015Updated 10 years ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.β23Oct 14, 2023Updated 2 years ago
- β23Jan 13, 2024Updated 2 years ago
- β36Feb 1, 2024Updated 2 years ago
- π RAG Python Chat Bot: Gemini, Ollama, Streamlit with LangChain magic! π€π¬β41Feb 14, 2024Updated 2 years ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.β61Oct 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β27Dec 24, 2024Updated last year
- β23May 14, 2024Updated last year
- Multimodal AI App using Llava 7B and Gradio.β39Apr 30, 2024Updated 2 years ago
- β11Jun 21, 2023Updated 2 years ago
- Medical Help App using GPT-4Vβ26Jan 1, 2024Updated 2 years ago
- Predicting Robinhood stocks using attentionβ11Sep 4, 2019Updated 6 years ago
- Sample notebooks and prompts for LLM evaluationβ161Nov 2, 2025Updated 6 months ago
- The StreamingGradioCallbackHandler is a custom callback handler that works with Language Models (LLMs) that support streaming. It facilitβ¦β10Oct 21, 2023Updated 2 years ago
- β21Nov 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [Nature Medicine] The Limits of Fair Medical Imaging AI In Real-World Generalizationβ29Dec 30, 2024Updated last year
- Chrome Extension powered by LLMβ17Feb 29, 2024Updated 2 years ago
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yoβ¦β104Nov 24, 2023Updated 2 years ago
- LLM Prompt Testing Quick Startβ79Jun 3, 2024Updated last year
- A web application to generate multiple choice questions exams for any topic using GPT-3.5β24Oct 21, 2024Updated last year
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extractionβ12May 1, 2024Updated 2 years ago
- β10Aug 14, 2020Updated 5 years ago