Vokturz / fast-embeddings-apiLinks
fast-embeddings-api
☆15Updated 2 years ago
Alternatives and similar repositories for fast-embeddings-api
Users that are interested in fast-embeddings-api are comparing it to the libraries listed below
Sorting:
- Universal text classifier for generative models☆25Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 3 months ago
- ☆53Updated 10 months ago
- ☆14Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆46Updated last year
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Updated 2 years ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆119Updated this week
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Evaluation of bm42 sparse indexing algorithm☆72Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆164Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated last month
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆33Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 3 months ago
- PyLate efficient inference engine☆68Updated 3 months ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆85Updated 11 months ago
- Voyage AI Official Python Library☆83Updated last week
- ☆36Updated last year
- ☆85Updated 2 years ago