datacommonsorg / llm-toolsLinks
β72Updated 2 months ago
Alternatives and similar repositories for llm-tools
Users that are interested in llm-tools are comparing it to the libraries listed below
Sorting:
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ114Updated 9 months ago
- π§ Compare how Agent systems perform on several benchmarks. ππβ103Updated 5 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- DSPY on action with OpenSource LLMs.β103Updated last year
- Simple examples using Argilla tools to build AIβ57Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β90Updated last month
- β125Updated 11 months ago
- β82Updated 2 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β83Updated last year
- β68Updated last year
- Official Repo for CRMArena and CRMArena-Proβ133Updated 2 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ90Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β92Updated last year
- β66Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"β60Updated 11 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ126Updated 3 months ago
- β147Updated last year
- β92Updated 2 years ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.β163Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.β199Updated last year
- ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!β53Updated 6 months ago
- A Lightweight Library for AI Observabilityβ255Updated 11 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)β82Updated 11 months ago
- Evaluation of bm42 sparse indexing algorithmβ72Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.β140Updated 5 months ago
- Voyage AI Official Python Libraryβ91Updated this week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ148Updated last year
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β152Updated last year