runpod-workers / worker-infinity-embeddingLinks
Create embeddings with infinity as serverless endpoint
☆41Updated 6 months ago
Alternatives and similar repositories for worker-infinity-embedding
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below
Sorting:
- Examples of models deployable with Truss☆208Updated last week
- ☆40Updated 6 months ago
- Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
- A function to do all☆35Updated last year
- Open-source RAG evaluation through users' feedback☆207Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- ☆47Updated last year
- ☆18Updated 11 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆163Updated 3 weeks ago
- faster-whisper as serverless endpoint☆125Updated 6 months ago
- RAG example using DSPy, Gradio, FastAPI☆86Updated last year
- ☆120Updated last month
- A toolkit for building computer use AI agents☆178Updated 4 months ago
- The Official Python Client for Together's API☆77Updated this week
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆128Updated 3 weeks ago
- AI Assistant that can get stock prices☆46Updated last year
- LLM Agents: Landing Page Generation for an E-commerce Platform Using CrewAI, Groq-LangChain and Qdrant☆14Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated this week
- A couple scripts to grab stats from email☆43Updated last year
- 🪢 Langfuse documentation -- Langfuse is the open source LLM Engineering Platform. Observability, evals, prompt management, playground an…☆144Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- Groq MCP server☆32Updated this week
- Globally distributed compute in the cloud built for production.☆41Updated last week
- Data Questionnaire Agent Chatbot☆69Updated last month
- ☆116Updated 11 months ago
- Democratizing access to LLMs for the open-source community. Let's advance AI, together.☆29Updated 2 years ago
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- ☆11Updated last year
- Tutorial for building LLM router☆235Updated last year