garg-ankush / scipe
SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.
☆21Updated 5 months ago
Alternatives and similar repositories for scipe:
Users that are interested in scipe are comparing it to the libraries listed below
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆66Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 9 months ago
- ☆77Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆77Updated 6 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆50Updated 5 months ago
- Simple Graph Memory for AI applications☆84Updated 9 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Dynamic Metadata based RAG Framework☆72Updated 8 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆122Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆31Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆80Updated last year
- A list of AI memory projects☆94Updated 3 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 8 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆50Updated 6 months ago
- ☆57Updated last year
- ☆16Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Embed anything.☆29Updated 10 months ago
- Reactive DDD with DSPy☆22Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more a…☆73Updated last month
- ☆33Updated 2 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated 2 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- ☆38Updated last week
- Verbosity control for AI agents☆62Updated 11 months ago