interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆36Updated 5 months ago
Alternatives and similar repositories for function-calling-eval:
Users that are interested in function-calling-eval are comparing it to the libraries listed below
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought"☆88Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆46Updated 9 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- ☆65Updated 7 months ago
- ☆107Updated 3 weeks ago
- ☆68Updated 2 months ago
- ☆48Updated last year
- ☆18Updated 3 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- ☆30Updated 6 months ago
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆20Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 4 months ago
- Simple examples using Argilla tools to build AI☆51Updated last month
- ☆75Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- entropix style sampling + GUI☆25Updated 2 months ago
- Tools to make language models a bit easier to use☆32Updated last month
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆53Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆68Updated 3 weeks ago
- ☆20Updated last year
- Simple Graph Memory for AI applications☆81Updated 5 months ago