runpod-workers / worker-infinity-embeddingLinks
Create embeddings with infinity as serverless endpoint
☆34Updated last month
Alternatives and similar repositories for worker-infinity-embedding
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below
Sorting:
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- ☆66Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated last year
- ☆40Updated 2 months ago
- ☆95Updated last month
- Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆62Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆36Updated last year
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated 2 years ago
- ⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol☆147Updated 9 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆105Updated 2 weeks ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- A starter app to build AI powered chat bots with Astra DB and LlamaIndex☆74Updated last year
- Embed anything.☆28Updated last year
- Python client library for improving your LLM app accuracy☆98Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- Analyzing chat interactions w/ LLMs to improve 🦜🔗 Langchain docs☆79Updated last year
- Own your AI, search the web with it🌐😎☆86Updated 6 months ago
- Examples of models deployable with Truss☆189Updated this week
- Build reliable, secure, and production-ready AI apps easily.☆74Updated this week
- Simple Graph Memory for AI applications☆88Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆82Updated 4 months ago
- ☆115Updated 6 months ago
- Function Calling Benchmark & Testing☆87Updated last year
- Anthropic Computer Use with Modal Sandboxes☆37Updated 8 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 11 months ago