davanstrien / data-for-fine-tuning-llms
β77Updated 10 months ago
Alternatives and similar repositories for data-for-fine-tuning-llms:
Users that are interested in data-for-fine-tuning-llms are comparing it to the libraries listed below
- β78Updated 10 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Simple examples using Argilla tools to build AIβ52Updated 4 months ago
- β51Updated 10 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- β66Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- β19Updated 5 months ago
- β29Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.β50Updated 6 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ101Updated last week
- β40Updated 2 months ago
- Tools to make language models a bit easier to useβ41Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated last year
- Leverage your LangChain trace data for fine tuningβ41Updated 8 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 7 months ago
- A framework for evaluating function calls made by LLMsβ37Updated 8 months ago
- Dynamic Metadata based RAG Frameworkβ72Updated 8 months ago
- β12Updated 11 months ago
- Verbosity control for AI agentsβ61Updated 10 months ago
- Knowledge Graph Generator appβ30Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ105Updated this week
- β24Updated last year
- β88Updated last year
- RAG example using DSPy, Gradio, FastAPIβ76Updated last year
- β45Updated 11 months ago
- Chunk your text using gpt4o-mini more accuratelyβ44Updated 8 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β66Updated 5 months ago