dcarpintero / wikisearchLinks
Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.
☆15Updated 2 years ago
Alternatives and similar repositories for wikisearch
Users that are interested in wikisearch are comparing it to the libraries listed below
Sorting:
- LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.☆45Updated 2 years ago
- Notebooks for ThirdAI demos☆80Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Create a knowledge graph out of unstructed legal text - use said knowledge graph in a graph augmented retrieval augmented generation pipe…☆63Updated last year
- Split and analyze text files using langchain and streamlit☆50Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆181Updated last year
- Universal text classifier for generative models☆25Updated last year
- What, Why and How of LLMs.☆75Updated 3 months ago
- ☆20Updated last year
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆195Updated last week
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆57Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- 🚀 A list of Haystack Integrations, maintained by the community or deepset.☆99Updated last week
- The long-term memory for your Superagents 🥷and LLMs 🤖. Built with GraphRAG, Knowledge graphs and autonomous ai agents☆63Updated 11 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆48Updated 8 months ago
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆99Updated 2 years ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆82Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆87Updated last year
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆14Updated 3 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆176Updated last week
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- ☆15Updated last year
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆77Updated 8 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆48Updated last year