rag-wtf / open-text-embeddingsLinks
Open Source Text Embedding Models with OpenAI Compatible API
☆157Updated last year
Alternatives and similar repositories for open-text-embeddings
Users that are interested in open-text-embeddings are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated last year
- A tool for generating function arguments and choosing what function to call with local LLMs☆427Updated last year
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated last year
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆287Updated last month
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆240Updated last year
- This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…☆69Updated last year
- ☆231Updated last month
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆153Updated 10 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated last month
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆275Updated last year
- Langport is a language model inference service☆93Updated 10 months ago
- Clone of https://r.jina.ai which is deployable locally☆47Updated 10 months ago
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆260Updated last month
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆24Updated 11 months ago
- Finetune ALL LLMs with ALL Adapeters on ALL Platforms!☆325Updated 2 weeks ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆316Updated last year
- AI for all: Build the large graph of the language models☆272Updated last year
- A simple, easy-to-hack Vector Database☆156Updated 8 months ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆64Updated 11 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- 👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.☆351Updated 6 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆342Updated this week
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆69Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆187Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆235Updated 11 months ago
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- A OpenAI API compatible REST server for llama.☆208Updated 5 months ago
- Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.☆390Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆428Updated 7 months ago
- ☆64Updated last year