msoedov / vector_lake
S3 vector database for LLM Agents and RAG.
☆36Updated last year
Alternatives and similar repositories for vector_lake:
Users that are interested in vector_lake are comparing it to the libraries listed below
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆91Updated 5 months ago
- Constrain LLM output☆108Updated 8 months ago
- Repo to experiment with Graph RAG strategies using Kùzu☆49Updated 3 months ago
- DuckDB Community Extension to prompt LLMs from SQL☆43Updated 2 months ago
- Read infrastructure data from your cloud ☁️ and export it to a SQL database 📋.☆33Updated last year
- LLM-Powered Analyses of your GitHub Community using EvaDB☆24Updated last year
- rerank library for easy reranking of results☆37Updated 6 months ago
- ☆14Updated last year
- Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/☆195Updated last year
- Framework for building data agent workflows☆83Updated 7 months ago
- FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)☆106Updated 2 months ago
- DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles☆48Updated this week
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆190Updated last year
- Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines☆63Updated last month
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆66Updated last month
- Graph Engine for Exploration and Search☆40Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆56Updated 3 months ago
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated last year
- Lambda function to serverlessly repartition parquet files in S3☆35Updated 4 months ago
- A simple DAG for executing LLM calls and using tools.☆41Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆16Updated last week
- portable Python ML-powered data bot☆23Updated 5 months ago
- Code for "Chat with your data using OpenAI, Pinecone, Airbyte and Langchain" tutorial☆36Updated last year
- Private ChatGPT/Perplexity. Securely unlocks knowledge from confidential business information.☆62Updated 5 months ago
- Time series forecasting with DuckDB and Evidence☆39Updated 4 months ago
- Data Encoding and Representation Analysis☆40Updated last year
- ☆22Updated 5 months ago
- Build a RAG dataset for your domain in just a few lines of codes, using your XML sitemap☆45Updated 7 months ago
- Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.☆81Updated 4 months ago