datacommonsorg / llm-toolsLinks

☆62

Alternatives and similar repositories for llm-tools

Users that are interested in llm-tools are comparing it to the libraries listed below

Sorting:

rhyang2021 / SELFGOAL
Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆68Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 3 months ago
SalesforceAIResearch / SFR-RAG
☆77Updated 6 months ago
Yannael / multilingual-embeddings
☆64Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆68Updated last year
chrisammon3000 / dspy-neo4j-knowledge-graph
LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.
☆187Updated last year
yale-nlp / SciArena
Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
☆45Updated this week
apple / ml-superposition-prompting
☆145Updated last year
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆99Updated this week
SalesforceAIResearch / CRMArena
Official Repo for CRMArena and CRMArena-Pro
☆104Updated last month
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆31Updated last year
DunZhang / Stella
☆62Updated last year
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆114Updated 3 weeks ago
zhudotexe / redel
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆83Updated 4 months ago
miralab-ai / autoreason
☆40Updated 7 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
ali-bahrainian / RAG_best_practices
☆93Updated 4 months ago
rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆113Updated last week
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆223Updated this week
whyhow-ai / whyhow
Automated knowledge graph creation SDK
☆122Updated 8 months ago
vaughanlove / PromptBreeder
Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.
☆132Updated last year
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
CYQIQ / MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
☆147Updated last year
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆87Updated 10 months ago
GoodAI / goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…
☆76Updated 7 months ago
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago