Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking.
☆49Oct 13, 2024Updated last year
Alternatives and similar repositories for contextual-doc-retrieval-opneai-reranker
Users that are interested in contextual-doc-retrieval-opneai-reranker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆49Oct 2, 2024Updated last year
- This experimental tool leverages Google's Gemini 2.5 Flash Preview model to parse complex tables from PDF documents and convert them into…☆15May 16, 2025Updated last year
- A Retrieval-Augmented Generation (RAG) system running DeepSeek R1 Distill LLama 70B model using Groq's fast inference API.☆13Jan 29, 2025Updated last year
- Contextual Retrieval solves this problem by prepending chunk-specific explanatory context to each chunk before embedding (“Contextual Emb…☆28Sep 29, 2024Updated last year
- Automate web interactions using LLMs and Llama-Index AgentWorkflow☆16Jan 25, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An implementation of an iterative knowledge base search agent using Agno agent framework, inspired by Ashpreet DeepKnowledge concept usin…☆15Feb 6, 2025Updated last year
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆67Dec 19, 2024Updated last year
- ☆30Oct 4, 2024Updated last year
- RFP-Response Analyzer is a Flask web app that uses AI to analyze and compare RFP documents with their responses, providing insights, gap …☆33Jan 2, 2025Updated last year
- Ironpad is a local-first, file-based project management system I've been building with AI. Rust backend (Axum), Vue 3 frontend, markdown …☆58Feb 27, 2026Updated 3 months ago
- Yaraa (Yet Another Rag Automation Attempt) is a library that tackles the boring aspects of managing Rag pipelines, so you don't have to.☆26Sep 5, 2024Updated last year
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆86Oct 2, 2024Updated last year
- A showcase of Phi Agent capabilities featuring a Streamlit-powered Research and Finance AI assistant built with Meta Llama 3.3 70B Instru…☆13Dec 7, 2024Updated last year
- Code/data for MARG (multi-agent review generation)☆64Mar 5, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).☆39Feb 24, 2026Updated 3 months ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆52Dec 30, 2024Updated last year
- ☆18Jan 10, 2025Updated last year
- ☆12Dec 11, 2022Updated 3 years ago
- ☆10Oct 19, 2023Updated 2 years ago
- Asynchronous iOS audio recording library designed for real-time speech audio processing☆17Oct 24, 2024Updated last year
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Enable real file upload for ChatGPT (o1, o1-pro, etc ...)☆28Apr 12, 2025Updated last year
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Apr 11, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- Collaborative AI Model☆11Nov 27, 2024Updated last year
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- ☆11Feb 1, 2021Updated 5 years ago
- Documentation source for docs.drawthings.ai☆24Apr 26, 2024Updated 2 years ago
- ☆18Dec 28, 2024Updated last year
- Adds immersive background music and ambient sounds to your chats.☆17Jun 14, 2025Updated 11 months ago
- ☆11May 2, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 8 months ago
- Command to generate a route map of your Ember application.☆15Jan 28, 2020Updated 6 years ago
- AI-powered tweet optimization tool using DSPy with hill-climbing algorithm☆28Oct 15, 2025Updated 7 months ago
- Extract information from Glimmer components to generate documentation using typescript parser/checker☆14Mar 19, 2024Updated 2 years ago
- A new business pipeline architecture for Ruby/Rails applications☆14Aug 2, 2024Updated last year
- Implementation of RFC 756, Default Helper Manager☆12May 10, 2025Updated last year
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago