limcheekin / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆152Updated 9 months ago
Alternatives and similar repositories for open-text-embeddings:
Users that are interested in open-text-embeddings are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated 10 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 9 months ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆238Updated last year
- This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…☆64Updated 11 months ago
- Comparison of Language Model Inference Engines☆214Updated 4 months ago
- Code implement reposity of Paper HiQA☆100Updated last month
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆284Updated last week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆307Updated this week
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆147Updated 6 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆205Updated 8 months ago
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆92Updated last year
- Examples on how to use LangChain and Ray☆227Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 4 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆65Updated last year
- Local LLM ReAct Agent with Guidance☆158Updated last year
- AI for all: Build the large graph of the language models☆263Updated 10 months ago
- fastertransformer for codegeex model☆63Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated last week
- Multimodal LLM Application with PyMuPDF4LLM☆36Updated 6 months ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆23Updated 7 months ago
- A tool for generating function arguments and choosing what function to call with local LLMs☆424Updated last year
- Tutorials from AutoGen Basics to Use Cases☆29Updated last year
- A fast batching API to serve LLM models☆182Updated 11 months ago
- Langport is a language model inference service☆94Updated 7 months ago
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆276Updated last year
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆239Updated 9 months ago
- Clone of https://r.jina.ai which is deployable locally☆44Updated 7 months ago
- ☆51Updated 9 months ago
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆225Updated 7 months ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆54Updated 7 months ago