kevaldekivadiya2415 / textembedLinks

TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding models and frameworks, making it ideal for various natural language processing applications.

☆25

Alternatives and similar repositories for textembed

Users that are interested in textembed are comparing it to the libraries listed below

Sorting:

etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆44Updated last year
HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆94Updated 4 months ago
jina-ai / submodular-optimization
Submodular optimization for context engineering: query fan-out, text selection, passage reranking
☆75Updated 3 months ago
DunZhang / Stella
☆62Updated last year
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆149Updated last year
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆70Updated last year
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆240Updated last year
denser-org / denser-retriever
An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.
☆290Updated 4 months ago
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆73Updated 10 months ago
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 9 months ago
NVIDIA / workbench-llamafactory
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
☆67Updated last year
puppetm4st3r / baai_m3_simple_server
This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…
☆70Updated last year
SalesforceAIResearch / SFR-RAG
☆79Updated 9 months ago
milvus-io / milvus-model
A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.
☆57Updated 7 months ago
ibm-self-serve-assets / Blended-RAG
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers
☆75Updated 5 months ago
Yannael / multilingual-embeddings
☆65Updated last year
asprenger / ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆75Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆140Updated last year
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆179Updated last year
TebooNok / HiQA
Code implement reposity of Paper HiQA
☆103Updated 7 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
EasyShopAI / rag-lab
Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot
☆46Updated 4 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆240Updated last year
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆68Updated last year
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆230Updated 3 weeks ago
gomate-community / rageval
Evaluation tools for Retrieval-augmented Generation (RAG) methods.
☆166Updated 11 months ago
AlexBodner / How_Much_VRAM
☆102Updated last year
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆91Updated last month
wenqiglantz / nvidia-sec-finetuning
Fine-Tuning LLM and embedding models
☆28Updated 2 years ago
Yangjiaxi / Sense
[ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"
☆68Updated last year