runpod-workers / worker-infinity-embedding
☆25Updated 2 weeks ago
Alternatives and similar repositories for worker-infinity-embedding:
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below
- ☆38Updated 4 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated 11 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 3 months ago
- ☆30Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆74Updated last year
- Using modal.com to process FineWeb-edu data☆19Updated last month
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 11 months ago
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆55Updated 11 months ago
- Plug n Play GBNF Compiler for llama.cpp☆23Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Data extraction with LLM on CPU☆68Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆43Updated 3 weeks ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆275Updated last week
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- Agent computer interface for AI software engineer.☆29Updated this week
- Self-host LLMs with vLLM and BentoML☆81Updated this week
- Run embedding models using ONNX☆29Updated last year
- Generates grammer files from typescript for LLM generation☆36Updated 11 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆96Updated 3 months ago
- Unsloth Studio☆51Updated 3 months ago
- Embed anything.☆29Updated 8 months ago
- Build reliable, secure, and production-ready AI apps easily.☆57Updated last week
- Vector Database with support for late interaction and token level embeddings.☆52Updated 4 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆41Updated this week
- ☆24Updated last year
- ☆4Updated 5 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year