Querying local documents, powered by LLM
☆658Jan 17, 2026Updated 4 months ago
Alternatives and similar repositories for llm-search
Users that are interested in llm-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Generic rag framework to apply the power of LLMs on any given dataset☆677Feb 24, 2026Updated 2 months ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- Ask a directory of files questions. Powered by ChromaDB and ChatGPT☆14Aug 15, 2023Updated 2 years ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,577May 12, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Harness LLMs with Multi-Agent Programming☆4,015May 6, 2026Updated 2 weeks ago
- An LLM-powered advanced RAG pipeline built from scratch☆857Jan 26, 2024Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,919May 17, 2025Updated last year
- WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.☆1,592Jan 31, 2026Updated 3 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,052Feb 27, 2025Updated last year
- ⚡ Local chat assistants with AI superpowers☆336Feb 13, 2026Updated 3 months ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,701May 11, 2026Updated last week
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- Ship RAG based LLM web apps in seconds.☆1,004Jan 29, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆13Sep 19, 2024Updated last year
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60May 17, 2023Updated 3 years ago
- Using LlamaIndex, Redis, and OpenAI to chat with PDF documents. Supplementary material for blog post on Microsoft Developer Blog☆113Nov 9, 2023Updated 2 years ago
- LlamaIndex is the leading document agent and OCR platform☆49,501Updated this week
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Sep 5, 2023Updated 2 years ago
- Self-evaluating interview for AI coders☆599Jun 21, 2025Updated 10 months ago
- High accuracy RAG for answering questions from scientific documents with citations☆8,496Mar 20, 2026Updated 2 months ago
- structured outputs for llms☆12,974Updated this week
- Using Langroid's Multi-Agent Framework to Build LLM Apps☆152Jun 29, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Supercharge Your LLM Application Evaluations 🚀☆13,896Feb 24, 2026Updated 2 months ago
- Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. D…☆11,988Oct 9, 2025Updated 7 months ago
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 7 months ago
- A multimodal, function calling powered LLM webui.☆213Sep 23, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- Go ahead and axolotl questions☆11,938Updated this week
- Efficient Retrieval Augmentation and Generation Framework☆1,779Jan 12, 2026Updated 4 months ago
- High-performance retrieval engine for unstructured data☆1,583Nov 10, 2025Updated 6 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed not…☆27,360Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during run…☆95Apr 8, 2026Updated last month
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,700May 13, 2026Updated last week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- Large-scale LLM inference engine☆1,727May 8, 2026Updated last week
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆121Jan 28, 2024Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year