Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking.
☆50Oct 13, 2024Updated last year
Alternatives and similar repositories for contextual-doc-retrieval-opneai-reranker
Users that are interested in contextual-doc-retrieval-opneai-reranker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆47Oct 2, 2024Updated last year
- This experimental tool leverages Google's Gemini 2.5 Flash Preview model to parse complex tables from PDF documents and convert them into…☆15May 16, 2025Updated 10 months ago
- A Retrieval-Augmented Generation (RAG) system running DeepSeek R1 Distill LLama 70B model using Groq's fast inference API.☆13Jan 29, 2025Updated last year
- Contextual Retrieval solves this problem by prepending chunk-specific explanatory context to each chunk before embedding (“Contextual Emb…☆28Sep 29, 2024Updated last year
- Automate web interactions using LLMs and Llama-Index AgentWorkflow☆16Jan 25, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An implementation of an iterative knowledge base search agent using Agno agent framework, inspired by Ashpreet DeepKnowledge concept usin…☆15Feb 6, 2025Updated last year
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆19Dec 20, 2024Updated last year
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆65Dec 19, 2024Updated last year
- RFP-Response Analyzer is a Flask web app that uses AI to analyze and compare RFP documents with their responses, providing insights, gap …☆32Jan 2, 2025Updated last year
- ☆30Oct 4, 2024Updated last year
- Yaraa (Yet Another Rag Automation Attempt) is a library that tackles the boring aspects of managing Rag pipelines, so you don't have to.☆26Sep 5, 2024Updated last year
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆86Oct 2, 2024Updated last year
- A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evo…☆1,713Jan 15, 2026Updated 2 months ago
- Code/data for MARG (multi-agent review generation)☆63Mar 5, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Jun 7, 2025Updated 10 months ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).☆35Feb 24, 2026Updated last month
- Document Q& A using RAG - GEMINI PRO☆40Feb 13, 2024Updated 2 years ago
- ☆10Oct 19, 2023Updated 2 years ago
- Qualitative Data Analysis done by AI (or LLMs). 🖥️ Streamlit & 🔗 Langchain☆50Jan 20, 2025Updated last year
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Enable real file upload for ChatGPT (o1, o1-pro, etc ...)☆29Apr 12, 2025Updated 11 months ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆29Jan 26, 2026Updated 2 months ago
- Data Structures in Python☆10Updated this week
- Let ChatGPT find you proactively☆12Apr 15, 2023Updated 2 years ago
- Sample retrieval-augmented generation (RAG) app template using Azure AI Search, Azure OpenAI, and Vercel AI SDK☆43Aug 14, 2025Updated 7 months ago
- Collaborative AI Model☆11Nov 27, 2024Updated last year
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- ☆11May 2, 2022Updated 3 years ago
- Python code which creates a semantic search bot over any available corpus☆17May 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆10Jan 24, 2021Updated 5 years ago
- Extract information from Glimmer components to generate documentation using typescript parser/checker☆14Mar 19, 2024Updated 2 years ago
- LLM Applications built using Streamlit, LangChain, and OpenAI API☆11Oct 7, 2023Updated 2 years ago
- A new business pipeline architecture for Ruby/Rails applications☆14Aug 2, 2024Updated last year
- Implementation of RFC 756, Default Helper Manager☆12May 10, 2025Updated 11 months ago
- A list of curated OpenSearch links☆11May 17, 2024Updated last year
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆14Oct 18, 2025Updated 5 months ago