interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated 7 months ago
Alternatives and similar repositories for function-calling-eval:
Users that are interested in function-calling-eval are comparing it to the libraries listed below
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆65Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- ☆30Updated 8 months ago
- ☆48Updated last year
- ☆76Updated 9 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 5 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- LLM reads a paper and produce a working prototype☆50Updated this week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆73Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 6 months ago
- Simple Graph Memory for AI applications☆84Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆49Updated 5 months ago
- Synthetic Data for LLM Fine-Tuning☆112Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆83Updated this week
- ☆47Updated 11 months ago
- ☆111Updated 3 months ago
- ☆20Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆104Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆58Updated last year
- ☆38Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- ☆75Updated last year