rmartinshort / text_chunkingLinks

Exploration of semantic chunking and chunk classification

☆16

Alternatives and similar repositories for text_chunking

Users that are interested in text_chunking are comparing it to the libraries listed below

Sorting:

padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 10 months ago
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆45Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
ianhohoho / auto-hyde
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆32Updated last year
yip-kl / llm_dspy_tutorial
Tutorial for DSPy
☆23Updated last year
davanstrien / data-for-fine-tuning-llms
☆79Updated last year
skandavivek / DSPy-blog
A tutorial on DSPy and whether automated prompt engineering lives up to the hype
☆24Updated last year
jacoblee93 / oss-model-extraction-evals
☆31Updated last year
miralab-ai / autoreason
☆40Updated 7 months ago
jjovalle99 / agentic-design-patterns
☆14Updated last year
intellectronica / battle-of-the-semantics
GraphRag vs Embeddings
☆15Updated last year
kbmurali / som-driven-qa-rag
Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…
☆16Updated last year
VianneyMI / baker
Baker is an AI powered app that helps you find recipes and avoid food waste
☆14Updated 7 months ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
boschresearch / switchprompt
Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…
☆52Updated 2 years ago
clab2024 / clab
LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…
☆16Updated last year
DerwenAI / textgraphs
TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph
☆24Updated last year
plastic-labs / dspy-opentom
Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset
☆17Updated last year
darshil3011 / AutoMetaRAG
Dynamic Metadata based RAG Framework
☆75Updated last year
jmanhype / Golden-Retriever
A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…
☆33Updated 11 months ago
mrmaheshrajput / productionizing-llms
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆11Updated last year
tahreemrasul / semantic_research_engine
A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…
☆83Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 3 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
davidberenstein1957 / dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
☆47Updated 11 months ago
ali-bahrainian / RAG_best_practices
☆93Updated 4 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
kinivi / AlchemyLab
A place where I experiment with AI and share with a world
☆24Updated last year
yale-nlp / SciArena
Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
☆45Updated last month
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 7 months ago