nlmatics / nlm-ingestor
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
☆1,228Updated last month
Alternatives and similar repositories for nlm-ingestor:
Users that are interested in nlm-ingestor are comparing it to the libraries listed below
- Developer APIs to Accelerate LLM Projects☆1,642Updated 6 months ago
- High-performance retrieval engine for unstructured data☆1,364Updated this week
- Improved file parsing for LLM’s☆2,936Updated 5 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆788Updated 5 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,014Updated 2 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆854Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,426Updated 2 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,206Updated last month
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆776Updated 3 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,400Updated last month
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,587Updated 2 weeks ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,569Updated last week
- Open-source tool to visualise your RAG 🔮☆1,126Updated 4 months ago
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆895Updated last week
- Things you can do with the token embeddings of an LLM☆1,440Updated last month
- LLM(😽)☆1,667Updated 3 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆835Updated last year
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆743Updated this week
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,200Updated 8 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,787Updated this week
- A Repo For Document AI☆2,810Updated 3 weeks ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,647Updated 3 weeks ago
- OCR Benchmark☆470Updated 2 weeks ago
- Data-Driven Evaluation for LLM-Powered Applications☆489Updated 3 months ago
- ☆871Updated 6 months ago
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,014Updated 5 months ago
- DOM to Semantic-Markdown for use with LLMs☆822Updated 3 months ago
- Seamlessly integrate LLMs as Python functions☆2,283Updated last week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆421Updated last year
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆299Updated last month