PrithivirajDamodaran / blitz-embedLinks

C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.

☆22

Alternatives and similar repositories for blitz-embed

Users that are interested in blitz-embed are comparing it to the libraries listed below

Sorting:

AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
michaelfeil / embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆45Updated 10 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆114Updated 3 weeks ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆76Updated 9 months ago
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆32Updated 11 months ago
davanstrien / data-for-fine-tuning-llms
☆79Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆133Updated last week
unicamp-dl / InRanker
☆48Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
mickymultani / RAG-with-Cross-Encoder-Reranker
Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.
☆48Updated last year
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆85Updated 2 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
cohere-ai / BinaryVectorDB
Efficient vector database for hundred millions of embeddings.
☆207Updated last year
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆88Updated last year
cohere-ai / DiskVectorIndex
☆211Updated last month
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year