datacommonsorg / llm-toolsLinks
☆62Updated 6 months ago
Alternatives and similar repositories for llm-tools
Users that are interested in llm-tools are comparing it to the libraries listed below
Sorting:
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆68Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 3 months ago
- ☆77Updated 6 months ago
- ☆64Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆187Updated last year
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆45Updated this week
- ☆145Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆99Updated this week
- Official Repo for CRMArena and CRMArena-Pro☆104Updated last month
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 10 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated last month
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆31Updated last year
- ☆62Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆114Updated 3 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆83Updated 4 months ago
- ☆40Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 10 months ago
- ☆93Updated 4 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆113Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆223Updated this week
- Automated knowledge graph creation SDK☆122Updated 8 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆132Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆142Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆147Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆87Updated 10 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆76Updated 7 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆86Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago