Vokturz / fast-embeddings-apiLinks

fast-embeddings-api

☆14

Alternatives and similar repositories for fast-embeddings-api

Users that are interested in fast-embeddings-api are comparing it to the libraries listed below

Sorting:

Knowledgator / unlimited_classifier
Universal text classifier for generative models
☆24Updated 10 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆43Updated 4 months ago
etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆42Updated 10 months ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
PrithivirajDamodaran / blitz-embed
C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…
☆22Updated last year
iulia-b10 / query_transformations
☆14Updated last year
pygongnlp / CoSearchAgent
[SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models
☆24Updated last year
michaelfeil / embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆45Updated 8 months ago
dmarx / zero-shot-intent-classifier
Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.
☆33Updated 2 years ago
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆40Updated last year
kevaldekivadiya2415 / textembed
TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…
☆24Updated 9 months ago
ianhohoho / auto-hyde
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆32Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 7 months ago
appier-research / structure-gen
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
☆21Updated last week
Knowledgator / LiqFit
Efficient few-shot learning with cross-encoders.
☆52Updated last year
oceanumeric / EnteRAG
A RAG that can scale 🧑🏻‍💻
☆11Updated last year
wenqiglantz / nvidia-sec-finetuning
Fine-Tuning LLM and embedding models
☆27Updated last year
geronimi73 / 3090_shorts
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever
☆39Updated 2 weeks ago
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆65Updated 5 months ago
tcapelle / mixtral
Mixtral finetuning
☆19Updated last year
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆70Updated 7 months ago
huggingface / screensuite
ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!
☆34Updated this week
pampa-labs / llmate
☆10Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆48Updated last year
nbroad1881 / strideformer
Using short models to classify long texts
☆21Updated 2 years ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆56Updated 3 weeks ago
davidberenstein1957 / dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
☆47Updated 9 months ago
LLukas22 / llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
☆75Updated last year
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆126Updated this week
rmartinshort / text_chunking
Exploration of semantic chunking and chunk classification
☆13Updated 8 months ago