runpod-workers / worker-infinity-embeddingLinks
Create embeddings with infinity as serverless endpoint
β34Updated 2 months ago
Alternatives and similar repositories for worker-infinity-embedding
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below
Sorting:
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β342Updated this week
- Own your AI, search the web with itππβ87Updated 6 months ago
- Examples of models deployable with Trussβ192Updated last week
- Self-host LLMs with vLLM and BentoMLβ139Updated last week
- Routing on Random Forest (RoRF)β187Updated 10 months ago
- Tutorial for building LLM routerβ221Updated last year
- π | Python library for RunPod API and serverless worker SDK.β244Updated this week
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMTβ27Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ87Updated last month
- Unsloth Studioβ98Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 9 months ago
- β40Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!β150Updated 6 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busineβ¦β151Updated 10 months ago
- A Lightweight Library for AI Observabilityβ250Updated 5 months ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.β120Updated last week
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).β102Updated 9 months ago
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.β167Updated last year
- A framework for evaluating function calls made by LLMsβ37Updated last year
- One click templates for inferencing Language Modelsβ203Updated this week
- Function Calling Benchmark & Testingβ88Updated last year
- A simple Python sandbox for helpful LLM data agentsβ277Updated last year
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β37Updated last year
- A function to do allβ35Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.β150Updated last month
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEndβ26Updated 2 years ago
- A toolkit for building computer use AI agentsβ170Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 6 months ago