agamm / semantic-split
A Python library to chunk/group your texts based on semantic similarity.
☆95Updated 9 months ago
Alternatives and similar repositories for semantic-split:
Users that are interested in semantic-split are comparing it to the libraries listed below
- ☆178Updated this week
- ☆217Updated 4 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆124Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆162Updated 6 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆174Updated 7 months ago
- Python API for https://vespa.ai, the open big data serving engine☆120Updated last week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- DSPY on action with OpenSource LLMs.☆70Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆48Updated 6 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆195Updated 6 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated this week
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆290Updated 3 weeks ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- ☆120Updated last month
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆194Updated last week
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆177Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆106Updated 7 months ago
- This repository implements the chain of verification paper by Meta AI☆168Updated last year
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆284Updated this week
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆205Updated 5 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆108Updated this week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆420Updated last year
- ☆74Updated 3 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 7 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆435Updated 3 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- Automated knowledge graph creation SDK☆120Updated 4 months ago
- Create a knowledge graph out of unstructed legal text - use said knowledge graph in a graph augmented retrieval augmented generation pipe…☆41Updated 6 months ago