A complete guide to evaluate LLMs and RAGs. Both theory and code based approaches covered.
☆28Nov 16, 2023Updated 2 years ago
Alternatives and similar repositories for Evaluation-of-LLMs-and-RAGs
Users that are interested in Evaluation-of-LLMs-and-RAGs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Feb 15, 2024Updated 2 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-design☆22Dec 13, 2024Updated last year
- Question Answer Generation App from the documents. Primarily suited to Teachers and related Academia's posts.☆28Aug 11, 2023Updated 2 years ago
- ☆11Apr 8, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 8, 2024Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Mar 6, 2024Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- TorchEsegeta: Interpretability and Explainability pipeline for PyTorch☆20Feb 19, 2024Updated 2 years ago
- Phi-2 Fine Tuning to build a mental health GPT.☆11Jan 6, 2024Updated 2 years ago
- This is the official repo of "Quick Minutes of Meeting using ChatGPT" video on AI Anytime YouTube channel. We have used Da Vinci 003 mode…☆15Sep 27, 2023Updated 2 years ago
- DeepScenario: An Open Driving Scenario Dataset for Autonomous Driving System Testing☆39Jan 26, 2024Updated 2 years ago
- Playing with RAG using Ollama, Langchain, and Streamlit. This project aims to demonstrate how a recruiter or HR personnel can benefit fro…☆16Jan 21, 2024Updated 2 years ago
- ☆16Mar 10, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆24Jun 12, 2024Updated last year
- Phi-2 Colab Notebook☆14Dec 14, 2023Updated 2 years ago
- This is a QA bot based on Code Llama. It is a new LLM for code by Meta AI.☆13Aug 25, 2023Updated 2 years ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Feb 8, 2025Updated last year
- ☆25Aug 20, 2024Updated last year
- ☆14Nov 20, 2015Updated 10 years ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Oct 14, 2023Updated 2 years ago
- ☆23Jan 13, 2024Updated 2 years ago
- ☆36Feb 1, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 🚀 RAG Python Chat Bot: Gemini, Ollama, Streamlit with LangChain magic! 🤖💬☆39Feb 14, 2024Updated 2 years ago
- Groq Chat App built using Groq API and Streamlit.☆31Mar 15, 2024Updated 2 years ago
- ☆27Dec 24, 2024Updated last year
- ☆23May 14, 2024Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆39Apr 30, 2024Updated last year
- ☆74Apr 24, 2024Updated last year
- ☆11Jun 21, 2023Updated 2 years ago
- Predicting Robinhood stocks using attention☆11Sep 4, 2019Updated 6 years ago
- Sample notebooks and prompts for LLM evaluation☆160Nov 2, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The StreamingGradioCallbackHandler is a custom callback handler that works with Language Models (LLMs) that support streaming. It facilit…☆10Oct 21, 2023Updated 2 years ago
- ☆21Nov 15, 2023Updated 2 years ago
- A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and func…☆26Oct 9, 2024Updated last year
- Evaluating LLMs with CommonGen-Lite☆95Mar 21, 2024Updated 2 years ago
- Chrome Extension powered by LLM☆17Feb 29, 2024Updated 2 years ago
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yo…☆104Nov 24, 2023Updated 2 years ago
- LLM Prompt Testing Quick Start☆79Jun 3, 2024Updated last year