lperezmo / embeddings-extraction
Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.
☆13Updated 8 months ago
Alternatives and similar repositories for embeddings-extraction
Users that are interested in embeddings-extraction are comparing it to the libraries listed below
Sorting:
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆13Updated last week
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 9 months ago
- Rust bindings for CTranslate2☆14Updated last year
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- FalkorDB-Browser is a visualization UI for FalkorDB.☆30Updated this week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 3 weeks ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆26Updated 2 months ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated last month
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆21Updated 2 weeks ago
- Query, ask and chat with a document-index via transformer models!☆17Updated last year
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆10Updated 3 months ago
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated last month
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 3 weeks ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…☆13Updated last month
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 3 weeks ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆13Updated 4 months ago
- ☆11Updated 2 years ago
- A minimal implementation of GraphRAG, designed to quickly prototype whether you're able to get good sense-making out of a large dataset w…☆28Updated 3 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆13Updated 9 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp☆25Updated 11 months ago
- ☆45Updated 7 months ago
- Automatic Test Generator☆12Updated last month