agamm / semantic-splitLinks

A Python library to chunk/group your texts based on semantic similarity.

☆97

Alternatives and similar repositories for semantic-split

Users that are interested in semantic-split are comparing it to the libraries listed below

Sorting:

stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
predlico / ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…
☆108Updated last year
Unstructured-IO / unstructured-inference
☆189Updated last month
aurelio-labs / semantic-chunkers
☆231Updated last month
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆114Updated 3 weeks ago
dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆128Updated last year
chentong0 / factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆159Updated last year
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆222Updated 2 weeks ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆148Updated last month
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆133Updated this week
CYQIQ / MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
☆146Updated last year
chrisammon3000 / dspy-neo4j-knowledge-graph
LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.
☆187Updated last year
urchade / GraphER
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
☆76Updated last year
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆434Updated last year
YZ-Cai / SimGRAG
Official code of the ACL 2025 paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"
☆118Updated last week
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆68Updated 7 months ago
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆99Updated 9 months ago
ritun16 / chain-of-verification
This repository implements the chain of verification paper by Meta AI
☆172Updated last year
isaacus-dev / semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
☆347Updated last month
szeighami / nudge
Lightweight Non-Parametric Embedding Fine-Tuning
☆28Updated 10 months ago
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆222Updated last month
khaimt / qa_expert
This repo is for handling Question Answering, especially for Multi-hop Question Answering
☆67Updated last year
docugami / KG-RAG-datasets
Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets
☆167Updated last year
henrikalbihn / gliner-as-a-service
GLiNER model in a FastAPI microservice.
☆45Updated 7 months ago
apple / ml-superposition-prompting
☆145Updated last year
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆236Updated 11 months ago
flairNLP / fabricator
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
☆108Updated last year
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆66Updated last year
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago