puppetm4st3r / baai_m3_simple_server
This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking tasks using the BAAI M3 multilingual model.
☆47Updated 4 months ago
Related projects: ⓘ
- A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.☆142Updated 2 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆171Updated 3 weeks ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆219Updated 2 weeks ago
- Open Source Text Embedding Models with OpenAI Compatible API☆124Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆127Updated 2 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆298Updated last week
- ☆160Updated 2 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆227Updated last week
- ☆42Updated 5 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆117Updated this week
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆59Updated 9 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆117Updated 3 weeks ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆379Updated this week
- An Open Source Toolkit For LLM Distillation☆284Updated last month
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆136Updated 3 weeks ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation☆157Updated 3 weeks ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated 3 weeks ago
- ☆293Updated 9 months ago
- ☆73Updated 8 months ago
- Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.☆274Updated 7 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆98Updated 2 weeks ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆239Updated 4 months ago
- This repository implements the chain of verification paper by Meta AI☆151Updated 11 months ago
- ☆109Updated last month
- Fine-Tuning Embedding for RAG with Synthetic Data☆456Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆101Updated last week
- A Python library to chunk/group your texts based on semantic similarity.☆77Updated 2 months ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆111Updated 5 months ago
- ☆236Updated 2 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆120Updated 8 months ago