Dicklesworthstone / swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
☆920Updated this week
Related projects: ⓘ
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆821Updated 8 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆785Updated 7 months ago
- High-performance retrieval engine for unstructured data☆778Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,031Updated 3 weeks ago
- Curate better data for LLMs☆934Updated 6 months ago
- The Open Source Memory Layer For Autonomous Agents☆1,390Updated last week
- 🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and mo…☆871Updated this week
- Build robust LLM applications with true composability 🔗☆410Updated 8 months ago
- LLMFlows - Simple, Explicit and Transparent LLM Apps☆659Updated 5 months ago
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆948Updated 3 weeks ago
- Build agents which are controlled by LLMs☆925Updated 6 months ago
- Build and query dynamic, temporally-aware Knowledge Graphs☆572Updated this week
- Agents Capable of Self-Editing Their Prompts / Python Code☆732Updated 6 months ago
- Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.☆1,925Updated this week
- Deterministic LLMs Outputs for AI Applications and AI Agents☆807Updated last week
- Open-source tool to visualise your RAG 🔮☆1,059Updated 6 months ago
- LLM(😽)☆1,602Updated this week
- Structured and typehinted GPT responses in Python☆734Updated last month
- Chat language model that can use tools and interpret the results☆1,358Updated this week
- ☆719Updated 5 months ago
- A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-s…☆1,022Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆2,817Updated 2 weeks ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,097Updated this week
- Ship RAG based LLM web apps in seconds.☆969Updated 7 months ago
- Things you can do with the token embeddings of an LLM☆730Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆790Updated last week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆467Updated last month
- ☆478Updated 3 weeks ago
- Exact structure out of any language model completion.☆497Updated last year
- Seamlessly integrate LLMs as Python functions☆1,940Updated this week