jina-ai / late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
☆246Updated last month
Related projects ⓘ
Alternatives and complementary repositories for late-chunking
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆192Updated 2 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆175Updated 2 weeks ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation☆215Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆254Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 5 months ago
- Automated Evaluation of RAG Systems☆484Updated 2 weeks ago
- ☆251Updated 4 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆504Updated this week
- This is a repository of RALM surveys containing a summary of state-of-the-art RAG and other technologies☆187Updated 5 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆61Updated last week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆412Updated last month
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆206Updated this week
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆138Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆133Updated last month
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆126Updated 5 months ago
- ☆131Updated 4 months ago
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆542Updated last month
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆134Updated this week
- ☆180Updated last week
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆349Updated last week
- awesome synthetic (text) datasets☆242Updated 3 weeks ago
- AWM: Agent Workflow Memory☆205Updated last month
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆100Updated last month
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆264Updated 2 weeks ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆154Updated 7 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆242Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆96Updated 7 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆356Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆180Updated 3 weeks ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆106Updated last month