philschmid / evaluate-llms
Includes examples on how to evaluate LLMs
β23Updated 5 months ago
Alternatives and similar repositories for evaluate-llms:
Users that are interested in evaluate-llms are comparing it to the libraries listed below
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- GenAI Experimentationβ58Updated 2 months ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Llβ¦β16Updated 11 months ago
- β143Updated 8 months ago
- Scripts, notebooks, and articles about data science in general.β47Updated last year
- Optimized Large Language Models for Financial Applications β Efficient, Scalable, and Domain-Specific AI for Finance.β46Updated 2 weeks ago
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 10 months ago
- Running load tests on a FastAPI application using Locustβ15Updated 3 weeks ago
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!β67Updated last week
- β15Updated last year
- β20Updated last year
- Dynamic Metadata based RAG Frameworkβ72Updated 8 months ago
- β88Updated last year
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with Lβ¦β81Updated 11 months ago
- Codebase accompanying the Summary of a Haystack paper.β77Updated 6 months ago
- β77Updated 10 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β101Updated last year
- β45Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β107Updated last week
- β29Updated last year
- The LangChain Crash Course Repository is a concise and comprehensive collection of learning materials for the LangChain programming languβ¦β21Updated last year
- A reasoning assistant for your STEM educationβ19Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated 2 weeks ago
- Explore the use of DSPy for extracting features from PDFs πβ39Updated last year
- Sample notebooks and prompts for LLM evaluationβ124Updated 4 months ago
- Controllable-RAG-Agent using Langgraphβ17Updated 8 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β11Updated last year