nlmatics / nlm-ingestor
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
☆1,154Updated 3 months ago
Alternatives and similar repositories for nlm-ingestor:
Users that are interested in nlm-ingestor are comparing it to the libraries listed below
- Developer APIs to Accelerate LLM Projects☆1,518Updated 2 months ago
- High-performance retrieval engine for unstructured data☆1,110Updated this week
- Improved file parsing for LLM’s☆2,637Updated 2 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆959Updated 3 months ago
- Build and query dynamic, temporally-aware Knowledge Graphs☆1,661Updated last week
- Things you can do with the token embeddings of an LLM☆1,411Updated last week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,249Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,433Updated this week
- An LLM-powered advanced RAG pipeline built from scratch☆816Updated 11 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆693Updated 2 months ago
- Open-source tool to visualise your RAG 🔮☆1,097Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,172Updated 4 months ago
- RAG that intelligently adapts to your use case, data, and queries☆2,747Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,528Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,238Updated last month
- A system for agentic LLM-powered data processing and ETL☆1,514Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,310Updated 4 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,671Updated this week
- The code used to train and run inference with the ColPali architecture.☆1,386Updated this week
- DOM to Semantic-Markdown for use with LLMs☆710Updated last week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆1,113Updated this week
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆847Updated last year
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆204Updated this week
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆994Updated 2 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆722Updated last month
- 🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and mo…☆967Updated last week
- Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs☆786Updated 7 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆970Updated this week
- Reliable LLM Memory for AI Applications and AI Agents☆1,058Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,294Updated this week