interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆36Updated 6 months ago
Alternatives and similar repositories for function-calling-eval:
Users that are interested in function-calling-eval are comparing it to the libraries listed below
- Routing on Random Forest (RoRF)☆112Updated 4 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- ☆48Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 2 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆70Updated 2 weeks ago
- Chat Markup Language conversation library☆55Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- ☆30Updated 7 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- ☆76Updated 8 months ago
- Convert a web page to markdown☆63Updated 5 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- ☆65Updated 8 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 9 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆70Updated 4 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆59Updated 3 months ago
- ☆38Updated last year
- ☆46Updated 10 months ago
- RAG example using DSPy, Gradio, FastAPI☆74Updated 10 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆21Updated 11 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- ☆111Updated last month
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 9 months ago