agamm / semantic-splitLinks
A Python library to chunk/group your texts based on semantic similarity.
☆97Updated last year
Alternatives and similar repositories for semantic-split
Users that are interested in semantic-split are comparing it to the libraries listed below
Sorting:
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆179Updated last year
- ☆237Updated 3 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆113Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆161Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆119Updated last week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆147Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuning☆36Updated 3 weeks ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆243Updated 2 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆191Updated last year
- ☆194Updated 3 weeks ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆180Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆143Updated last week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆70Updated 9 months ago
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆375Updated last month
- ☆77Updated 8 months ago
- Generalist and Lightweight Model for Text Classification☆162Updated 3 months ago
- This repository implements the chain of verification paper by Meta AI☆177Updated last year
- ☆124Updated 7 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆438Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆228Updated 3 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆147Updated last year
- ☆62Updated last year
- ☆146Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆289Updated 11 months ago