zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆63Updated last month
Related projects ⓘ
Alternatives and complementary repositories for strategic-debate-tot
- Official homepage for "Self-Harmonized Chain of Thought"☆83Updated 2 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆57Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆64Updated this week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated this week
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆47Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- Simple examples using Argilla tools to build AI☆40Updated this week
- A framework for evaluating function calls made by LLMs☆35Updated 3 months ago
- ☆37Updated 3 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆65Updated 4 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆110Updated last month
- Simple Graph Memory for AI applications☆79Updated 3 months ago
- Red-Teaming Language Models with DSPy☆142Updated 7 months ago
- ☆48Updated last year
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- ☆75Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Routing on Random Forest (RoRF)☆84Updated last month
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 6 months ago
- Automating enterprise workflows with multimodal agents☆94Updated last month
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- Claude API Test Project☆87Updated 6 months ago
- EcoAssistant: using LLM assistant more affordably and accurately☆129Updated 4 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆47Updated last month
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 11 months ago