dcarpintero / wikisearchLinks
Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.
☆15Updated 2 years ago
Alternatives and similar repositories for wikisearch
Users that are interested in wikisearch are comparing it to the libraries listed below
Sorting:
- LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.☆45Updated 2 years ago
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆58Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated last year
- Notebooks for ThirdAI demos☆79Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆84Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Explore new advancements like ChatGPT’s function calling capability, and build a conversational agent using a new syntax called LangChain…☆15Updated 2 years ago
- Split and analyze text files using langchain and streamlit☆50Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆184Updated last year
- Universal text classifier for generative models☆24Updated last year
- Examples using the Deep Search functionalities☆83Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆49Updated last year
- GPT-4V(ision) module for use with Autodistill.☆25Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆198Updated last week
- ☆104Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆137Updated 2 years ago
- Plattform for building a RDF Knowledge Graph from text Sources☆57Updated 3 years ago
- ☆60Updated 2 years ago
- ☆17Updated last year
- MTEB: Massive Text Embedding Benchmark French extended☆19Updated last year
- An LLM training library for instruction-tuning.☆26Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆49Updated 10 months ago
- OpenAI document chatbot using llama-index, pinecone and chainlit. With incremental features, giving you the tools to go from a basic RAG …☆80Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆140Updated 3 weeks ago
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago