runpod-workers / worker-infinity-embeddingLinks

Create embeddings with infinity as serverless endpoint

☆34

Alternatives and similar repositories for worker-infinity-embedding

Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below

Sorting:

runpod-workers / worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
☆342Updated this week
AstraBert / PrAIvateSearch
Own your AI, search the web with it🌐😎
☆87Updated 6 months ago
basetenlabs / truss-examples
Examples of models deployable with Truss
☆192Updated last week
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆139Updated last week
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆187Updated 10 months ago
anyscale / llm-router
Tutorial for building LLM router
☆221Updated last year
runpod / runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
☆244Updated this week
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 3 months ago
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
unslothai / unsloth-studio
Unsloth Studio
☆98Updated 4 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
andreasjansson / AutoCog
☆40Updated 2 months ago
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆150Updated 6 months ago
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 10 months ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆250Updated 5 months ago
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated last week
Itachi-Uchiha581 / Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
☆102Updated 9 months ago
jina-ai / rungpt
An open-source cloud-native of large multi-modal models (LMMs) serving framework.
☆167Updated last year
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆203Updated this week
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆88Updated last year
cohere-ai / cohere-terrarium
A simple Python sandbox for helpful LLM data agents
☆277Updated last year
jquesnelle / literAI
Generate visual podcasts about novels using open source models
☆25Updated 2 years ago
neoxelox / dspy-inspector
DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.
☆37Updated last year
n4ze3m / vexasearch
A function to do all
☆35Updated last year
modal-labs / awesome-modal
A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.
☆150Updated last month
bodaay / HuggingChatAllInOne
One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd
☆26Updated 2 years ago
agentsea / surfkit
A toolkit for building computer use AI agents
☆170Updated last month
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago