limcheekin / open-text-embeddingsLinks
Open Source Text Embedding Models with OpenAI Compatible API
☆153Updated 10 months ago
Alternatives and similar repositories for open-text-embeddings
Users that are interested in open-text-embeddings are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆130Updated 11 months ago
- This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…☆69Updated last year
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 10 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆150Updated 7 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆284Updated last month
- A tool for generating function arguments and choosing what function to call with local LLMs☆427Updated last year
- Clone of https://r.jina.ai which is deployable locally☆44Updated 8 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆318Updated 3 weeks ago
- ☆157Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 5 months ago
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆93Updated 2 years ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆86Updated last week
- Sentence Transformers API: An OpenAI compatible embedding API server☆59Updated 9 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆208Updated 10 months ago
- fastertransformer for codegeex model☆63Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆172Updated 8 months ago
- ☆225Updated 5 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆231Updated 9 months ago
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- Docker compose to run vLLM on Windows☆81Updated last year
- ☆313Updated last year
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆24Updated 9 months ago
- Benchmarking the serving capabilities of vLLM☆45Updated 9 months ago
- Langport is a language model inference service☆94Updated 8 months ago
- ☆74Updated last year
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- Visual Studio Code extension for WizardCoder☆147Updated last year
- ☆59Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆67Updated last year
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆248Updated 3 weeks ago