dair-ai / llm-evaluator
Example for Logging LLM Evaluator Prompt Responses
☆15Updated last year
Alternatives and similar repositories for llm-evaluator:
Users that are interested in llm-evaluator are comparing it to the libraries listed below
- ☆11Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- BH hackathon☆14Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- ☆20Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated last week
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- ☆13Updated 7 months ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Updated last year
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆23Updated this week
- ☆12Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 11 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 9 months ago
- ☆29Updated last year
- ☆16Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- Synthetic text dataset generation☆9Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- examples and guides to using Nomic Atlas☆32Updated last week
- ☆17Updated this week