TonicAI / tonic_validateLinks
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
β319Updated 4 months ago
Alternatives and similar repositories for tonic_validate
Users that are interested in tonic_validate are comparing it to the libraries listed below
Sorting:
- π¦π― Flex those feathers!β252Updated last year
- A tool for evaluating LLMsβ425Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β179Updated last year
- β186Updated 2 years ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β442Updated last year
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicatiβ¦β247Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ330Updated last year
- β175Updated last year
- Excel spreadsheet crawler and table parser for data extraction and queryingβ162Updated 8 months ago
- Python SDK for running evaluations on LLM generated responsesβ293Updated 5 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β114Updated last year
- Automated knowledge graph creation SDKβ122Updated 11 months ago
- Fine-Tuning Embedding for RAG with Synthetic Dataβ515Updated 2 years ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ146Updated last year
- Automated Evaluation of RAG Systemsβ667Updated 7 months ago
- A simple Python sandbox for helpful LLM data agentsβ288Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluationβ105Updated 10 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busineβ¦β151Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)β397Updated last year
- β197Updated this week
- Fiddler Auditor is a tool to evaluate language models.β188Updated last year
- β238Updated 5 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- FastAPI wrapper around DSPyβ279Updated last year
- π Datasets and models for instruction-tuningβ237Updated 2 years ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ290Updated 2 weeks ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.β115Updated 3 months ago
- An Awesome list of curated DSPy resources.β469Updated last month
- Sample notebooks and prompts for LLM evaluationβ153Updated last week
- β506Updated last year