interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆34Updated last month
Related projects: ⓘ
- Simple Graph Memory for AI applications☆76Updated last month
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆53Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- ☆48Updated 11 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- auto fine tune of models with synthetic data☆71Updated 7 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- ☆32Updated 2 weeks ago
- ☆64Updated 3 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆57Updated 4 months ago
- ☆75Updated 3 weeks ago
- ☆38Updated this week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆80Updated 3 weeks ago
- ☆37Updated 9 months ago
- ☆46Updated 5 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆26Updated 9 months ago
- ☆30Updated 6 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Official homepage for "Self-Harmonized Chain of Thought"☆45Updated this week
- Using modal.com to process FineWeb-edu data☆18Updated last week
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.☆48Updated 3 weeks ago
- ☆57Updated last year
- ☆101Updated 5 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆67Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago