Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking.
☆49Oct 13, 2024Updated last year
Alternatives and similar repositories for contextual-doc-retrieval-opneai-reranker
Users that are interested in contextual-doc-retrieval-opneai-reranker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆49Oct 2, 2024Updated last year
- This experimental tool leverages Google's Gemini 2.5 Flash Preview model to parse complex tables from PDF documents and convert them into…☆15May 16, 2025Updated last year
- A Retrieval-Augmented Generation (RAG) system running DeepSeek R1 Distill LLama 70B model using Groq's fast inference API.☆13Jan 29, 2025Updated last year
- Contextual Retrieval solves this problem by prepending chunk-specific explanatory context to each chunk before embedding (“Contextual Emb…☆28Sep 29, 2024Updated last year
- Automate web interactions using LLMs and Llama-Index AgentWorkflow☆16Jan 25, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆66Dec 19, 2024Updated last year
- ☆30Apr 23, 2025Updated last year
- ☆30Oct 4, 2024Updated last year
- RFP-Response Analyzer is a Flask web app that uses AI to analyze and compare RFP documents with their responses, providing insights, gap …☆33Jan 2, 2025Updated last year
- Yaraa (Yet Another Rag Automation Attempt) is a library that tackles the boring aspects of managing Rag pipelines, so you don't have to.☆26Sep 5, 2024Updated last year
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆86Oct 2, 2024Updated last year
- A showcase of Phi Agent capabilities featuring a Streamlit-powered Research and Finance AI assistant built with Meta Llama 3.3 70B Instru…☆13Dec 7, 2024Updated last year
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆26Jun 7, 2025Updated 11 months ago
- Deep Research through Multi-Agents, using GraphRAG☆86Aug 21, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- this is based on the paper Chain-of-Retrieval Augmented Generation☆15Mar 29, 2025Updated last year
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆52Dec 30, 2024Updated last year
- Document Q& A using RAG - GEMINI PRO☆42Feb 13, 2024Updated 2 years ago
- ☆12Dec 11, 2022Updated 3 years ago
- ☆10Oct 19, 2023Updated 2 years ago
- LangChain DeepResearch: Autonomous recursive research powered by any LLM☆19Mar 19, 2025Updated last year
- Repository for our "RAG in Practice (2025)" event!☆17Mar 26, 2025Updated last year
- Bringing semantic search to Django. Integrates seemlessly with Django ORM.☆48Oct 1, 2025Updated 7 months ago
- Qualitative Data Analysis done by AI (or LLMs). 🖥️ Streamlit & 🔗 Langchain☆49Jan 20, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Enable real file upload for ChatGPT (o1, o1-pro, etc ...)☆29Apr 12, 2025Updated last year
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 3 months ago
- Let ChatGPT find you proactively☆12Apr 15, 2023Updated 3 years ago
- Sample retrieval-augmented generation (RAG) app template using Azure AI Search, Azure OpenAI, and Vercel AI SDK☆46Aug 14, 2025Updated 9 months ago
- ☆11Feb 1, 2021Updated 5 years ago
- A simple Model Context Protocol (MCP) server for generating memes using the ImgFlip API☆49Mar 12, 2025Updated last year
- ☆21Mar 3, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11May 2, 2022Updated 4 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 8 months ago
- Python code which creates a semantic search bot over any available corpus☆17May 22, 2023Updated 2 years ago
- Extract information from Glimmer components to generate documentation using typescript parser/checker☆14Mar 19, 2024Updated 2 years ago
- A new business pipeline architecture for Ruby/Rails applications☆14Aug 2, 2024Updated last year
- Implementation of RFC 756, Default Helper Manager☆12May 10, 2025Updated last year
- A list of curated OpenSearch links☆11May 17, 2024Updated 2 years ago