zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
β68Updated 3 months ago
Alternatives and similar repositories for strategic-debate-tot:
Users that are interested in strategic-debate-tot are comparing it to the libraries listed below
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ99Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through rβ¦β58Updated 6 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought"β88Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ82Updated last week
- β48Updated last year
- β68Updated 2 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β66Updated 6 months ago
- Just a bunch of benchmark logs for different LLMsβ116Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β68Updated 3 weeks ago
- Writing Blog Posts with Generative Feedback Loops!β46Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ49Updated 10 months ago
- A framework for orchestrating AI agents using a mermaid graphβ75Updated 8 months ago
- Evaluating LLMs with CommonGen-Liteβ87Updated 9 months ago
- Simple examples using Argilla tools to build AIβ51Updated last month
- Simple Graph Memory for AI applicationsβ81Updated 5 months ago
- Synthetic Data for LLM Fine-Tuningβ107Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.β116Updated 3 months ago
- Red-Teaming Language Models with DSPyβ153Updated 9 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.β87Updated 5 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 6 months ago
- Routing on Random Forest (RoRF)β98Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β79Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ100Updated last month
- Track the progress of LLM context utilisationβ53Updated 6 months ago
- A framework for evaluating function calls made by LLMsβ36Updated 5 months ago
- β79Updated last week
- β20Updated last year