dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β23Updated 3 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ50Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 7 months ago
- Create embeddings for LLM using the Nomic APIβ23Updated 7 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- Structured outputs from DSPy and Jinja2β23Updated last month
- The official Python library for Formulaicβ16Updated last year
- Tools for formatting large language model prompts.β13Updated last year
- A text-to-SQL prototype on the northwind sqlite datasetβ12Updated 9 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!β17Updated last month
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Updated last year
- Fetch message history from discord for LLMsβ15Updated 3 weeks ago
- Python package for extractive NLP using the OpenAI APIβ17Updated 9 months ago
- π€οΈ Pathik - High-Performance Web Crawler β‘β26Updated 2 months ago
- GraphRag vs Embeddingsβ14Updated 11 months ago
- Web Interface for Vision Language Models Including InternVLM2β22Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!β48Updated last year
- applications of https://github.com/PrefectHQ/marvinβ12Updated last year
- Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Huβ¦β20Updated 3 weeks ago
- Embedding models from Jina AIβ60Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β35Updated last year
- LLM plugin for models hosted by Anyscale Endpointsβ33Updated last year
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, β¦β35Updated 10 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.β26Updated 3 months ago
- β20Updated last year
- Code interpreter support for o1β32Updated 9 months ago
- convert natural language into technical diagramsβ14Updated 6 months ago
- LLM code editor for backend servicesβ14Updated 8 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollamaβ10Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 7 months ago
- Apps that run on modal.comβ12Updated last year